tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ray Gauss II (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TIKA-775) Embed Capabilities
Date Tue, 08 Nov 2011 02:32:51 GMT
Embed Capabilities

                 Key: TIKA-775
                 URL: https://issues.apache.org/jira/browse/TIKA-775
             Project: Tika
          Issue Type: Improvement
          Components: general, metadata
    Affects Versions: 1.0
         Environment: The default ExternalEmbedder requires that sed be installed.
            Reporter: Ray Gauss II
             Fix For: 1.0
         Attachments: tika-core-embed-patch.txt, tika-parsers-embed-patch.txt

This patch defines and implements the concept of embedding tika metadata into a file stream,
the reverse of extraction.

In the tika-core project an interface defining an Embedder and a generic sed ExternalEmbedder
implementation meant to be extended or configured are added.  These classes are essentially
a reverse flow of the existing Parser and ExternalParser classes.

In the tika-parsers project an ExternalEmbedderTest unit test is added which uses the default
ExternalEmbedder (calls sed) to embed a value placed in Metadata.DESCRIPTION then verify the
operation by parsing the resulting stream.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message