lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pasha Bizhan" <fc...@ok.ru>
Subject RE: How to include a multi-word synonym to a word when indexing?
Date Tue, 12 Apr 2005 04:27:26 GMT
Hi, 

> From: Erik Hatcher [mailto:erik@ehatchersolutions.com] 

> > My problem is, however, that some words needs to have alternatives 
> > where the word is decomposed / decompounded into two or more words:
> >
> > "FooBar Corp" or "cybercafe"
> >
> > should be found when searching for
> >
> > "Foo Ba*" or "cyber cafe"
 
> You'll need some kind of lookup to know how to split a token like 
> "cybercafe" into two words - once you've done that it will be easy to 
> set the position increment of them to zero so that they overlay the 
> original term.

What about putting all synonyms into index? 
Foo Bar Corp, FooBar Corp, FooBarCorp, cyber cafe, cybercafe etc?
In this case we do no need analyze input query.

Pasha Bizhan
 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message