lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Franzone" <jonat...@franzone.com>
Subject RE: Using the term "C/C++" in queries
Date Sun, 10 Mar 2002 16:56:44 GMT
*This message was transferred with a trial version of CommuniGate(tm) Pro*

I'm actually using a WhitespaceAnalyzer and WhitespaceTokenizer that I
wrote. It parses terms on, of course, whitespace. Is there a way to get a
list of the tokens that are generated after indexing?

-----Original Message-----
From: Brian Goetz [mailto:brian@quiotix.com]
Sent: Sunday, March 10, 2002 7:57 AM
To: Lucene Users List
Subject: Re: Using the term "C/C++" in queries


*This message was transferred with a trial version of CommuniGate(tm) Pro*

>Has anyone had any experience/problems with using "C/C++" in Lucene
queries?
>When I do a query using that as a term I get back a bunch of records that
do
>not contain the term. The resulting query ends up being like: +(+c/c++)
when
>parsed by the QueryParser. I've also tried just adding Term objects to a
>query myself and it has the same affect. Thanks for any help.

If you're using any of the basic analyzers, the / and + terms will be
dropped in tokenization.  If you want to search for terms like C++, you'll
need a tokenizer that treats + as a letter.


--
Brian Goetz
Quiotix Corporation
brian@quiotix.com           Tel: 650-843-1300            Fax: 650-324-8032

http://www.quiotix.com


--
To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>



--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message