lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mead Lai <laiqi...@gmail.com>
Subject Re: Language Identifier with Lucene?
Date Mon, 24 Oct 2011 10:29:48 GMT
Luca,

I would like to know: how much language, your system could identify?
In my view, this difficult part in your system is: how to collect so many
languages/character in the world for *one person*...

Regards,
Mead


On Sun, Oct 23, 2011 at 1:27 AM, Petite Abeille <petite_abeille@me.com>wrote:

>
> On Oct 22, 2011, at 2:49 AM, Luca Rondanini wrote:
>
> > I usually use Nutch for this but, just for fun, I tried to create a
> language
> > identifier based on Lucene only.
>
> Talking of which:
>
> Google's Compact Language Detector
>
> http://blog.mikemccandless.com/2011/10/language-detection-with-googles-compact.html
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message