lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Sokolov <soko...@ifactory.com>
Subject Re: find meaningful words through Lucene
Date Wed, 27 Jun 2012 22:24:34 GMT
Maybe high frequency terms that are not evenly distributed throughout 
the corpus would be a better definition.  Discriminative terms.  I'm 
sure there is something in the machine learning literature about 
unsupervised clustering that would help here.  But I don't know what it 
is :)

-Mike

On 06/27/2012 05:09 AM, Ian Lea wrote:
> All words are important if they help people find what they want.
>
> Maybe you want high frequency terms.  See contrib class
> org.apache.lucene.misc.HighFreqTerms.
>
>
> --
> Ian.
>
>
> On Wed, Jun 27, 2012 at 3:04 AM, 齐保元<qibaoyuan@126.com>  wrote:
>    
>> meaningful just means the word is important than others,like keywords/keyphrase.
>>
>>
>>
>>
>>
>>      
>>> Please define meaningful.
>>>
>>> --
>>> Ian.
>>>
>>>
>>> On Tue, Jun 26, 2012 at 10:39 AM,<qibaoyuan@126.com>  wrote:
>>>        
>>>> hi,  does anyone knows how to extract meaningful words from Lucene index?
>>>>          
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>>
>>>        
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>    

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message