poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 51320] New: Determine whether parts other than QuillContents may contain useful text to extract and if so, support extraction from those
Date Fri, 03 Jun 2011 17:58:14 GMT
https://issues.apache.org/bugzilla/show_bug.cgi?id=51320

             Bug #: 51320
           Summary: Determine whether parts other than QuillContents may
                    contain useful text to extract and if so, support
                    extraction from those
           Product: POI
           Version: 3.2-FINAL
          Platform: PC
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HPBF
        AssignedTo: dev@poi.apache.org
        ReportedBy: dgoldenberg@attivio.com
    Classification: Unclassified


Right now, only QuillContents is taken into account when extracting text.

It seems worth researching whether any useful text may be extraced from the
Main and the Escher parts.

This is related to 51317 - Need ability to stream and chunk data out of MS
Publisher documents. If any extra parts get exposed we'd ideally want streaming
available on it.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Mime
View raw message