lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Snare <mikesn...@gmail.com>
Subject Re: Why does the StandardTokenizer split hyphenated words?
Date Thu, 16 Dec 2004 12:46:23 GMT
> Not if these words are spelling variations of the same concept, which
> doesn't seem unlikely.
> 
> > In addition, why do we assume that a-1 is a "typical product name" but
> > a-b isn't?
> 
> Maybe for "a-b", but what about English words like "half-baked"?

Perhaps that's the difference in thinking, then.  I would imagine that
you would want to search on "half-baked" and not "half AND baked".

> Regards
> Daniel
> 
> --
> http://www.danielnaber.de
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 
>

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message