lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wettin <karl.wet...@gmail.com>
Subject Re: Lucene newbee quesiton- Term Positions
Date Sun, 07 Oct 2007 16:37:17 GMT


7 okt 2007 kl. 18.38 skrev Erick Erickson:

> I suspect that this is more work than you think, not to mention
> very slow. This is just due to the nature of an inverted
> index....
>
> To see what I mean, get a copy of Luke and have it
> reconstruct one of your documents and you'll see what the
> performance is like.


Also, I recently posted this transparent code for TermVectorMapper  
that will build the term vector space model if it was not cached  
(Field.TermVector.NO):

https://issues.apache.org/jira/secure/attachment/12366959/ 
LUCENE-1016.txt

-- 
karl





>
> I think Luke has all the example code you could ask for, that's
> the place I'd look first. See:
> http://lucene.apache.org/java/docs/contributions.html
>
> Why do you want to do this and is it really necessary? You
> could think about storing the entire document, then when you
> needed to count terms, just using one of the tokenizers and
> counting them yourself....
>
> Best
> Erick
>
> On 10/7/07, Developer Developer <devquestions@gmail.com> wrote:
>>
>> Hello,
>>
>> I have simple lucene 2.2 index created. I want to  list all the  
>> terms and
>> their positions in a document. how can I do it ?
>>
>> Can you please provide some sample code.
>>
>> Thanks !
>>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message