commons-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Arjohn Kampman (JIRA)" <>
Subject [jira] [Created] (IMAGING-174) Support non-8BIM signatures in Photoshop segments
Date Mon, 19 Oct 2015 14:36:05 GMT
Arjohn Kampman created IMAGING-174:

             Summary: Support non-8BIM signatures in Photoshop segments
                 Key: IMAGING-174
             Project: Commons Imaging
          Issue Type: Bug
          Components: Format: JPEG
            Reporter: Arjohn Kampman

The code in IptcParser.parseAllBlocks(...) requires that all blocks have an "8BIM" signature.
However, we're frequently finding jpeg files that have "PHUT" signatures mixed in. Some sites
also report "AgHg" and "DCSR" signatures, for example: Although
the signature is not what the code expects, the block's data layout is still the same as for
8BIM. Please consider either parsing such blocks, or at least skip them with a warning. Currently,
the code throws an exception which prevents us from extracting any of the other metadata.
I'm attaching a sample image from the Enron Corpus that has two of these PHUT resource blocks.

This message was sent by Atlassian JIRA

View raw message