lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 23505] New: - Russian Analyzer assumes default encoding is iso-8859-1
Date Tue, 30 Sep 2003 01:08:13 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23505>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23505

Russian Analyzer assumes default encoding is iso-8859-1

           Summary: Russian Analyzer assumes default encoding is iso-8859-1
           Product: Lucene
           Version: CVS Nightly - Specify date in submission
          Platform: Macintosh
        OS/Version: MacOS X
            Status: NEW
          Severity: Normal
          Priority: Other
         Component: Analysis
        AssignedTo: lucene-dev@jakarta.apache.org
        ReportedBy: hani@formicary.net


On OSX, the default encoding is MacRoman, so this causes TestRussianAnalyzer to fail, since
the 
file is not read in correctly.

The correct solution is to explicitly specify that the test file should be read using iso-8859-1

encoding. I've attached a patch.

Mime
View raw message