lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stefan Groschupf>
Subject Re: inter-term correlation [was Re: Vector Space Model in Lucene?]
Date Sat, 15 Nov 2003 02:12:51 GMT

>On Friday 14 November 2003 13:39, Chong, Herb wrote:
>>you're describing ad-hoc solutions to a problem that have an effect, but
>>not one that is easily predictable. one can concoct all sorts of
>>combinations of the query operators that would have something of the effect
>>that i am describing. crossing sentence boundaries, however, can't be done
I'm not sure understand you right. You wish to make language based 
queries to a IR system?
If I'm right may be I can help you!
Here is presentation by the MIT that does something similar.
That is a video > 1h. But the first talk will contain the part is 
interesting for you.
Leslie Pack Kaelbling: MIT Professor, Electrical Engineering and 
Computer Science hold it.
(the sequence with the two guys with a phone is interesting for you).
They just have 2 components around the "nl ir query". It is speech to 
text and text to speech.

What you can do is use a pos tagger (i.e. a maximum entropy model based 
or  Brill tagger if you just have english) and use a data mining 
algorithm for weight your terms.
May be you can use a hidden Markov model for that.

You can build this on top of lucene, shouldn't be that difficult.

But may be I understand you wrong.. ..


open technology:
open source:
open discussion:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message