lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chong, Herb" <HCho...@bloomberg.com>
Subject RE: Can use Lucene be used for this
Date Thu, 13 Nov 2003 13:35:56 GMT
i suggest that you use a special tokenizer that breaks chemical names into their constituent
parts and index them as if they were words.

Herb....

-----Original Message-----
From: Hackl, Rene [mailto:Rene.Hackl@FIZ-Karlsruhe.DE]
Sent: Thursday, November 13, 2003 7:17 AM
To: 'Lucene Users List'
Subject: Re: Can use Lucene be used for this

Yes and yes. Users range from Information Professionals to "naive" end
users.
If there's a string like "N-(t-Butyl)-N-(3,5-dinitrobenzoyl)-nitroxyl" users
can be expected to search for "dinitro", "3,5-dinitro", "nitrobenz" etc.

There are also sequences of amino acids or DNA that users might want to
match
partially. 

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message