jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeremy Anderson (JIRA)" <j...@apache.org>
Subject [jira] Created: (JCR-2365) HTML Text Extractor does not extract or index numerics
Date Tue, 27 Oct 2009 12:46:59 GMT
HTML Text Extractor does not extract or index numerics
------------------------------------------------------

                 Key: JCR-2365
                 URL: https://issues.apache.org/jira/browse/JCR-2365
             Project: Jackrabbit Content Repository
          Issue Type: Bug
          Components: indexing, jackrabbit-text-extractors
    Affects Versions: 1.6.0
         Environment: Win XP-Pro; Win 2003 Enterprise 32bit
            Reporter: Jeremy Anderson


Numerics such as addresses/dates/financial figures are not extracted or indexed by the current
HTML Extractor.  These values are handled properly and searchable when done via the PlainTextExtractor

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message