lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Spencer <dave-lucene-u...@tropo.com>
Subject Re: I need 100 most frequently used words in different languages.
Date Thu, 12 May 2005 04:48:51 GMT
You could try downloading a copy of the wikipedia and processing the 
entries yourself. I don't know how well represented other languages are 
but there's lot of English.

Ahmet Aksoy wrote:

> Hi,
> I have a project which will be used in order to supply automatic 
> dictionary helps in different languages.
> I'm using Lucene for indexing, and searching the words in it.
> It is an open source project in java at address 
> http://belletmen.dev.java.net
> Now, I will prepare a function to find the natural language used in the 
> documentations.
> For this purpose, I'll use the 100 most frequently used words in 
> different languages.
> Now I have only Turkish, English, German, and Finnish lists.
> If you can help me, I'll be very glad.
> Regards.
> Ahmet Aksoy
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message