lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-5505) hunspell SET/FLAG whitespace/BOM handling
Date Sat, 08 Mar 2014 17:10:42 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-5505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13924931#comment-13924931
] 

ASF subversion and git services commented on LUCENE-5505:
---------------------------------------------------------

Commit 1575569 from [~rcmuir] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1575569 ]

LUCENE-5505: hunspell SET/FLAG whitespace and BOM handling

> hunspell SET/FLAG whitespace/BOM handling
> -----------------------------------------
>
>                 Key: LUCENE-5505
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5505
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Robert Muir
>             Fix For: 4.8, 5.0
>
>         Attachments: LUCENE-5505.patch
>
>
> Several dictionaries cannot be loaded today (Armenian, Papiamento, Macedonian, Russian,
Urdu) because they have stuff like SET<tab>UTF-8, FLAG<space><space>UTF-8
or have a BOM marker on the first line (or even combinations of these).
> Also because SET need not be the first line in the file, we should ignore BOM markers
on the first line in general (e.g. it might be something else like FLAG).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message