lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <>
Subject Contributing to Lucene (was RE: inter-term correlation [was Re: Vector Space Model in Lucene?])
Date Fri, 14 Nov 2003 23:45:19 GMT
Hello Herb,

I don't approve of several teasing, mean, etc. emails I saw from a few
people.  This is a serious and polite email. :)

It sounds like you know about NLP and see places where Lucene could be
improved.  Lucene is open source and free, and could benefit from
knowledgeable people like you.  Are you interested in contributing some
computational linguistics smarts, either as improvement of Lucene core
(if improvements are such that they don't make Lucene use more
difficult and its code significantly more complex and harder to
maintain), or as an add-on module, or some kind of an extension, or
even just as application built on top of Lucene, all of which could and
would live outside of Lucene's core?


--- "Chong, Herb" <> wrote:
> you're describing ad-hoc solutions to a problem that have an effect,
> but not one that is easily predictable. one can concoct all sorts of
> combinations of the query operators that would have something of the
> effect that i am describing. crossing sentence boundaries, however,
> can't be done without having some sentence boundaries as a reference.
> on top of this, there is a relatively simple concept which, if
> implemented, takes away all the ad-hocness of the solutions and
> replaces it with a something that is both linguistically and
> mathematically sound and on top of which won't materially make the
> engine core more complicated. that concept is that multiword queries
> are mostly multiword terms and they can't cross sentence boundaries
> according to the rules of English.
> Herb....
> -----Original Message-----
> From: Doug Cutting []
> Sent: Friday, November 14, 2003 3:33 PM
> To: Lucene Users List
> Subject: Re: inter-term correlation [was Re: Vector Space Model in
> Lucene?]
> Certainly there are lots of scoring algorithms that one cannot easily
> implement with Lucene.  I'm just not yet clear on what you need to do
> that Lucene cannot support.

Do you Yahoo!?
Protect your identity with Yahoo! Mail AddressGuard

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message