lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bruce Ritchie" <br...@jivesoftware.com>
Subject RE: TFIDF Implementation
Date Tue, 14 Dec 2004 20:29:44 GMT
 
> > You can also see 'Books like this' example from here 
> > 
> https://secure.manning.com/catalog/view.php?book=hatcher2&item=source
> 
> Well done, uses a term vector, instead of reparsing the orig 
> doc, to form the similarity query. Also I like the way you 
> exclude the source doc in the query, I didn't think of doing 
> that in my code.

I agree, it's a good way to exclude the source doc.
 
> I don't trust calling vector.size() and vector.getTerms() 
> within the loop but I haven't looked at the code to see if it 
> calculates  the results each time or caches them...

>From the code I looked at, those calls don't recalculate on every call. 


Regards,

Bruce Ritchie

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message