lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-4176) Can not produce proper collation key for ICUCollatedTermAttributeImp
Date Thu, 28 Jun 2012 11:57:43 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-4176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403038#comment-13403038
] 

Robert Muir commented on LUCENE-4176:
-------------------------------------

Thanks for reporting this: the bug is actually AnalyzingQueryParser. it should not consume
with CharTermAttribute.toString(), instead it should just consume the bytes.
                
> Can not produce proper collation key for ICUCollatedTermAttributeImp
> --------------------------------------------------------------------
>
>                 Key: LUCENE-4176
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4176
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: modules/queryparser
>    Affects Versions: 5.0
>            Reporter: Nattapong Sirilappanich
>
> org.apache.lucene.collation.tokenattributes.ICUCollatedTermAttributeImpl return a hash
of collation key's byte.
> The given hash value produce incorrect comparison result.
> The source code below return 1 for Lucene 3.6.
> The code here return 0.
> Code to reproduce:
> IndexWriter writer = new IndexWriter(ramDir, conf);
> Document doc = new Document();
> FieldType fieldType = new FieldType();
> fieldType.setIndexed(true);
> fieldType.setStored(true);
> Field field = new Field("content","เข", fieldType);
> doc.add(field);
> writer.addDocument(doc);
> writer.close();
> IndexSearcher is = new IndexSearcher(DirectoryReader.open(ramDir));
> QueryParser qp = new AnalyzingQueryParser(Version.LUCENE_50,"content", analyzer);
> ScoreDoc[] result = is.search(qp.parse("[\u0e01 TO \u0e03]"), null,1000).scoreDocs;
> System.out.println(result.length);

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message