lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dave Kor" <dave...@gmail.com>
Subject Re: word frequency list?
Date Mon, 04 Sep 2006 04:22:26 GMT
There is the Berkeley Web Term Frequency database which contains over
30 million unique terms extracted from 50 million webpages.

http://elib.cs.berkeley.edu/docfreq/index.html

On 8/31/06, Jason Pump <jpump@mindspring.com> wrote:
> Is there a large list of words and their frequency in the english
> language? Obviously it would differ by corpus but I would like to see
> what's already available.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

-- 
Dave Kor, PhD Candidate
Center for Information Mining and Extraction
School of Computing
National University of Singapore.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message