lucy-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "David E. Wheeler" <da...@kineticode.com>
Subject Re: [lucy-dev] utf8proc, control chars and non-character code points
Date Wed, 14 Dec 2011 16:34:25 GMT
On Dec 14, 2011, at 2:18 AM, Peter Karman wrote:

> Swish3 uses \003 control character as an internal field delimiter so passing
> that through is pretty vital. Are you saying that utf8proc chokes on that valid
> UTF-8 sequence?

I do the same thing to index lists of things on Lucy in PGXN:

  https://github.com/pgxn/pgxn-api/blob/master/lib/PGXN/API/Indexer.pm#L77

Best,

David


Mime
View raw message