tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-396) Parser Attachements from Outlook Messages
Date Wed, 14 Apr 2010 10:11:50 GMT

    [ https://issues.apache.org/jira/browse/TIKA-396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12856826#action_12856826

Jukka Zitting commented on TIKA-396:

In revision 933903 I modified the OutlookExtractor to use the parser instance in the ParseContext
instead of a hardcoded AutoDetectParser when parsing the attachments. This is similar to what
the PackageParser does, and allows better client-level control of the parsing process.

Note that there's now an extra "Invalid attachment id" line being printed to system out as
a part of the tika-parsers test suite. I guess this comes from POI.

> Parser Attachements from Outlook Messages
> -----------------------------------------
>                 Key: TIKA-396
>                 URL: https://issues.apache.org/jira/browse/TIKA-396
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.6
>         Environment: All environments.
>            Reporter: Dave Meikle
>            Assignee: Dave Meikle
> As raised by Albert Jensen on the tika-user mailing list[1], it would be good for the
Outlook Parser to iterate through the mails attachments and then extract their content.
> [1]http://mail-archives.apache.org/mod_mbox/lucene-tika-user/201003.mbox/%3C002701cacccf$16108b40$4231a1c0$@mail.dk%3E

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message