lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris Salem" <ch...@mainsequence.net>
Subject Re: searching for c++, c#, etc...
Date Thu, 16 Jul 2009 14:23:02 GMT
I'm using the StandardAnalyzer for both searching and indexing.
Here's the code to parse the query:
Searcher searcher = new IndexSearcher(reader);
Analyzer analyzer = new StandardAnalyzer(stopwords);
System.out.println(queryString);
QueryParser qp = new QueryParser(searchField,analyzer);
Query query = qp.parse(queryString);
queryString = query.toString();
System.out.println(queryString);
And here's the output from the println's:
r2_resume_text:c\+\+ AND r2_resume_text: c\#
+r2_resume_text:c +r2_resume_text:c
Also the documentation doesn't say anything about # having to be escaped.
Do I have to escape during indexing too?
Sincerely,
Chris Salem 



----- Original Message ----- 
To: java-user@lucene.apache.org, Chris Salem <chris@mainsequence.net>
From: Ian Lea <ian.lea@gmail.com>
Sent: 7/16/2009 5:12:53 AM
Subject: Re: searching for c++, c#, etc...


Hi


Escaping should work. See
http://lucene.apache.org/java/2_4_1/queryparsersyntax.html and
QueryParser.escape(). And you need to be sure that your analyzer
isn't removing the plus signs and that you use the same analyzer for
indexing and searching.

Googling for something like "lucene escape" will find you more info.

Luke will tell you what is actually in your index.


--
Ian.


On Wed, Jul 15, 2009 at 5:19 PM, Chris Salem<chris@mainsequence.net> wrote:
> Hello,
> I'm trying to search for the terms like c++ but the parser is stripping off the ++. 
I tried escaping the ++ with slashes but it's still stripping it off.  I could replace + with
"plus", is that the best way to do it?  How come escaping isn't working?
> thanks
> Sincerely,
> Chris Salem
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message