Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-user@lucene.apache.org
Received-SPF: pass (herse.apache.org: local policy)
Message-ID: <461619F7.2030206@buongiorno.com>
Date: Fri, 06 Apr 2007 11:59:19 +0200
From: Roberto Fonti <roberto.fonti@buongiorno.com>
User-Agent: Mozilla Thunderbird 1.5.0.10 (Windows/20070221)
MIME-Version: 1.0
To: java-user@lucene.apache.org
Subject: UN_TOKENIZED and StandardAnalyzer
Content-Type: multipart/alternative;
 boundary="------------070202090601030002010801"

--------------070202090601030002010801
Content-Type: text/plain; charset=ISO-8859-15; format=flowed
Content-Transfer-Encoding: 7bit

Hi All,
I'm indexing categories with this code:

for (Category category : item.getCategories()) {            
    lucene_doc.add(new Field(
        "CATEGORY",
        category.getName(),
        Field.Store.NO,
        Field.Index.UN_TOKENIZED));               
}

And searching using the query:

String query = "CATEGORY:("+category.getName()+")";

I've configured to use the StandardAnalyzer both in the IndexWriter for 
the QueryParser.

Everything goes fine BUT with categories that contains whitespaces (or 
other chars that get tokenized).

* If category is "sport" - ok, I get the result from the search
* If category is "winter sport" - I get no result from search

I've tried with a number of search syntax:
+CATEGORY:"winter sport"
+CATEGORY:winter +CATEGORY:sport
+CATEGORY:(winter sport)
and other...
but none of them work.

What's wrong with that?
By the way, using the KeywordAnalyzer it works, but it is not the 
correct analyzer for my application.
Shouldn't the Analyzer be ignored for a Field.Index.UN_TOKENIZED field?

Thanks,
Roberto
 

--------------070202090601030002010801--