lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-477) AnalysisRequestHandler
Date Tue, 12 Feb 2008 04:28:08 GMT

    [ https://issues.apache.org/jira/browse/SOLR-477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567949#action_12567949
] 

Yonik Seeley commented on SOLR-477:
-----------------------------------

I've been meaning to create a handler like this...
I'm not sure that Token should be directly added to XMLWriter and friends though... seems
more flexible for the handler to handle the transformation from token(s) to more primitive
types.  Or did you actually add a <token> tag to the xml?


> AnalysisRequestHandler
> ----------------------
>
>                 Key: SOLR-477
>                 URL: https://issues.apache.org/jira/browse/SOLR-477
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>
> Being able to programmatically access tokenization information can be quite useful not
only in Solr, but in other NLP applications where token vectors are necessary.
> The patch to follow creates an AnalysisRequestHandler which processes a document through
the analysis process and returns a response filled with tokens, their offsets, position inc.,
type and value.
> Patch also adds some character array processing to Xml and adds Token handling to XMLWriter.
> I only implemented Xml output, as I don't know JSON or the other types.  If someone else
is so motivated, they can add those.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message