lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <>
Subject Re: inter-term correlation [was Re: Vector Space Model in Lucene?]
Date Fri, 14 Nov 2003 20:41:08 GMT
Leo Galambos wrote:
> There are other (more trivial) problems as well. One geek from UFAL (our 
> NLP lab) reported, that it was a hard problem to find the boundaries, or 
> rather, to say whether a dot is a dot or something else, i.e. "blah, 
> i.e. blah" "i.b.m." "i.p. pavlov" "3.14" "28.10.2003" etc.
> On the other hand, I would rather like to know the model which is 
> implemented by Lucene. If it is not a vector model, what is it? ;-)

I would call it a vector space model.

The best description of how Lucene scores is:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message