incubator-couchdb-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Davis <paul.joseph.da...@gmail.com>
Subject Re: Problem with czech alphabet in View_Collation
Date Tue, 31 May 2011 17:42:22 GMT
2011/5/31 Robert Newson <robert.newson@gmail.com>:
> We already use ICU for collation and the keys you mentioned are
> correctly ordered in UCA order. What's missing is support for custom
> tailoring rules, I think.
>
> B.
>
> 2011/5/31 Ladislav Thon <ladicek@gmail.com>:
>> IIRC, this isn't supported right now, but will be (might be? :-) ) in the
>> future. See this thread:
>> http://www.mail-archive.com/user@couchdb.apache.org/msg10606.html for
>> previous discussion.
>>
>> LT
>>
>> 2011/5/31 Ladislav Prskavec <ladislav@prskavec.net>
>>
>>> I have simple view:
>>>
>>> function(doc) {
>>> if (doc.parentId == "0") {
>>> emit(doc.seoname, [doc.id, doc.seourl, doc.rank]);
>>>  }
>>> }
>>>
>>> and in results:
>>>
>>> {"id":"S0001","key":"Brno Business
>>> School","value":["S0001","brno-business-school",5]},
>>> {"id":"41000","key":"\u010desk\u00e1 zem\u011bd\u011blsk\u00e1 univerzita v
>>> Praze","value":["41000","ceska-zemedelska-univerzita-v-praze",5]},
>>> {"id":"21000","key":"\u010cesk\u00e9 vysok\u00e9 u\u010den\u00ed
>>> technick\u00e9 v
>>> Praze","value":["21000","ceske-vysoke-uceni-technicke-v-praze",5]},
>>> {"id":"7D000","key":"CEVRO Institut, o. p.
>>> s.","value":["7D000","cevro-institut-o-p-s",5]}, {"id":"S0003","key":"CMC
>>> Graduate School of Business
>>> o.p.s.","value":["S0003","cmc-graduate-school-of-business-o-p-s",5]},
>>>
>>> I have problem with \u010 (Č) is before C, but in alphabet we have C, Č
>>> order.
>>> It's bug? ICU can solve this.
>>>
>>> Thanks for help.
>>>
>>>
>>>
>>
>

Now that we've moved to using NIF's I had been contemplating rewriting
the ICU driver as a NIF to see if there were any performance
differences. As part of that I would investigate the ability to pass
in these tailoring bits to allow people to do fancier ICU collation
that's been requested a couple times.

Mime
View raw message