lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tatu Saloranta <t...@hypermall.net>
Subject Re: N-gram layer
Date Wed, 04 Feb 2004 06:09:46 GMT
On Tuesday 03 February 2004 02:18, karl wettin wrote:
> On Tue, 3 Feb 2004 09:54:19 +0100
>
> karl wettin <kalle@snigel.dnsalias.net> wrote:
> > test has a weight of 1731 in Swedish
> > test has a weight of 1726 in Danish
>
> Oh dear. Mine fails too.

Considering swedish, danish and norwegian languages are very similar to each 
other, it's probably one of tougher cases to distinguish? And even more so 
for example of "jag heter Kalle", where one word is proper noun, not language 
word? I guess what I'm saying is that being heuristics, it's less dangerous 
to mix between languages that are similar, than with more distant ones.

-+ Tatu +-


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message