lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chong, Herb" <HCho...@bloomberg.com>
Subject RE: Vector Space Model in Lucene?
Date Fri, 14 Nov 2003 20:15:34 GMT
it has its limitations. that is why i am looking at what it would take to solve some of them.
parsing documents to recognize sentences and storing sentence boundaries in the index would
solve the ones that are most limiting. superposing interterm correlation on top of Lucene
isn't very useful because then you have to build a new almost-duplicate index. since when
people type in more than one word as a query, they almost invariably mean phrases, a search
engine that doesn't take advantage of that isn't making use of fundamental linguistic knowledge.
what has to be stored isn't much, but implementing it can be.

Herb....

-----Original Message-----
From: Erik Hatcher [mailto:erik@ehatchersolutions.com]
Sent: Friday, November 14, 2003 3:08 PM
To: Lucene Users List
Subject: Re: Vector Space Model in Lucene?


I get the feeling you're looking for reasons that Lucene is inadequate. 
  This may be the case for the uses you're speaking of, but there is 
quite a bit of flexibility with Lucene in terms of Analysis, scoring, 
and custom Query implementations that all relate to what you've been 
speaking of.  And, of course, Lucene is a low-level component of which 
a higher level piece could be built around.

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message