lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <e...@ehatchersolutions.com>
Subject Re: Unexpected analyzer behaviour with special characters
Date Thu, 13 Oct 2005 12:42:17 GMT
StandardAnalyzer does not have any escaped-character handling  
capability.  So the backslash and dash are treated without any  
special care in "session\-one".

You will need to build your own Analyzer with a custom tokenizer to  
handle this type of escaping.

     Erik

On Oct 13, 2005, at 8:29 AM, Xavier Orri wrote:

> Hi,
>
> I don't understand why my third test fails. If I scape my "-  
> delimiter"
>  what I expect is a character behaviour.
>
> Could you clarify me what I'm doing wrong?
>
>
> public void testStandardAnalyzer() throws Exception {
>     Analyzer a = new StandardAnalyzer();
>     assertAnalyzesTo(a, "ses1-one3", new String[]{"ses1-one3"});
>     assertAnalyzesTo(a, "session-one", new String[]{"session","one"});
>     assertAnalyzesTo(a, "session\\-one", new String[]{"session\\- 
> one"});
> }
>
> Thanking in advance
> Xavier Orri
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message