lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From mark harwood <>
Subject Re: frequent phrases
Date Thu, 09 Aug 2007 11:52:57 GMT
The "CollocationFinder" code attached to this may be more suited....

Again, not exactly sure of your use case.


----- Original Message ----
From: karl wettin <>
Sent: Thursday, 9 August, 2007 11:16:35 AM
Subject: Re: frequent phrases

9 aug 2007 kl. 09.34 skrev Akanksha Baid:

> I was wondering if there is a "search based" method to find the top-k
> frequent phrases in a set of documents.( I do not have a particular  
> phrase
> in mind so PhraseQuery can probably be ruled out).
> I have implemented something that works using termvectors and  
> termpositions
> but the performance is not great so far since I am basically iterating
> multiple times and hacking my way around. I was wondering if an API  
> exists
> for finding frequent phrases and/or if someone could point me to  
> some code
> for the same.

I think this is the closest thing available in the issue tracker:


To unsubscribe, e-mail:
For additional commands, e-mail:

Yahoo! Mail is the world's favourite email. Don't settle for less, sign up for
your free account today*

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message