lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mathieu Lecarme <math...@garambrogne.net>
Subject Re: Lucene and Google Web 1T 5 Gram
Date Thu, 24 Apr 2008 07:58:30 GMT
Rafael Turk a écrit :
> Hi Mathieu,
>
> *What do you wont to do?*
>
> An spell checker and related keyword suggestion
>
>   
Here is a spell checker wich I try to finalize :
https://admin.garambrogne.net/projets/revuedepresse/browser/trunk/src/java

> If you wont an ngram => popularity map, just use a berkley DB, and use this
> information in your Lucene application. Lucene is a reversed index, Berkeley
> DB an index.
>
> *Great ideia! Berkeley DB is definitely a try, simple and effective, but
> I'll have to work the data previously. I was hopping to take advantage of
> Lucene's built in features*
>   
Lucene provides nice tools without the need to index. Analyzer and 
TokenFilter can help you, i guess.

M.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message