lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Grant Ingersoll (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-975) Position based TermVectorMapper
Date Wed, 08 Aug 2007 02:35:59 GMT
Position based TermVectorMapper
-------------------------------

                 Key: LUCENE-975
                 URL: https://issues.apache.org/jira/browse/LUCENE-975
             Project: Lucene - Java
          Issue Type: New Feature
          Components: Store
            Reporter: Grant Ingersoll
            Assignee: Grant Ingersoll
            Priority: Minor


As part of the new TermVectorMapper approach to TermVectors, the ensuing patch loads term
vectors and stores the term info by position.  This should let people directly index into
a term vector given a position.  Actually, it does it through Maps, b/c the array based bookkeeping
is a pain given the way positions are stored.  

The map looks like:
Map<String,   Map<Integer, TVPositionInfo>>

where the String is the field name, the integer is the position, and TVPositionInfo is a storage
mechanism for the terms and offsets that occur at a position.  It _should_ handle multiple
terms per position (which is always my downfall! )

I have not tested performance of this approach.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message