lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From maurits van wijland <m.vanwijl...@quicknet.nl>
Subject Re: Java TextCat 0.1
Date Fri, 07 Nov 2003 19:12:03 GMT
Hi all,

Incze,  do you choose the analyer when indexing and seraching? how?
Can you send an example code?

I have tried this with a naive bayes language guesser, but the problem i
found is that whren searching, the query words are to 'small' to accurately
predict a language...

So, how do you manage?

kind regards,

Maurits van Wijland


----- Original Message ----- 
From: "Incze Lajos" <incze@mail.matav.hu>
To: "Lucene Developers List" <lucene-dev@jakarta.apache.org>
Sent: Friday, November 07, 2003 2:31 AM
Subject: Re: Java TextCat 0.1


> On Thu, Nov 06, 2003 at 02:14:11PM +0100, Patrick Debois wrote:
> > Java interfacing with libtextcat. Might be of interest for you
(according
> > to the mailling lists)
> >
> > I've used it for choosing the correct analyzer in Lucene Snowball
> >
> > I will provide it on my website http://www.jedi.be/JTextCat/index.html
> >
> > Hope it does not violate any copyrights.
> >
> > ---------------------------------------------------------------------
>
> Have you seen this project?
>
> http://ngramj.sourceforge.net/
>
> (Pure java N-Gram lib, with a sample servlet.)
>
> incze
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-dev-help@jakarta.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message