lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Uwe Schindler (JIRA)" <j...@apache.org>
Subject [jira] Updated: (SOLR-2381) The included jetty server does not support UTF-8
Date Wed, 09 Mar 2011 15:03:01 GMT

     [ https://issues.apache.org/jira/browse/SOLR-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Uwe Schindler updated SOLR-2381:
--------------------------------

    Attachment: SOLR-ServletOutputWriter.patch

Robert and me discussed about the Jetty OutputWriter and found out:

- It's much more broken, as it would even not support writing half surrogates in write(char[],
ofset, length), which may also fail for other ResponseWriters!!!
- Jettys implementation is SLOOOOOOOOOOOW!

The attached patch now uses no Writer supplied by Jetty or any other servlet container at
all - it just handles HTTP as it is: a binary protocol using byte streams. Like for UpdateReqHandler
it uses its own mapper inside Solr (on the input side ContentStream is used for that).

As most output in solr is done using UTF-8 (the default), it uses a pre-looked up NIO Charset
for that.

> The included jetty server does not support UTF-8
> ------------------------------------------------
>
>                 Key: SOLR-2381
>                 URL: https://issues.apache.org/jira/browse/SOLR-2381
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Blocker
>             Fix For: 3.1, 4.0
>
>         Attachments: SOLR-2381.patch, SOLR-2381_xmltest.patch, SOLR-ServletOutputWriter.patch,
SOLR-ServletOutputWriter.patch, jetty-6.1.26-patched-JETTY-1340.jar, jetty-6.1.26-patched-SOLR-2381.jar,
jetty-util-6.1.26-patched-JETTY-1340.jar, jetty-util-6.1.26-patched-SOLR-2381.jar, post_utf8enhanced.sh,
utf8enhanced.xml
>
>
> Some background here: http://www.lucidimagination.com/search/document/6babe83bd4a98b64/which_unicode_version_is_supported_with_lucene
> Some possible solutions:
> * wait and see if we get resolution on http://jira.codehaus.org/browse/JETTY-1340. To
be honest, I am not even sure where jetty is being maintained (there is a separate jetty project
at eclipse.org with another bugtracker, but the older releases are at codehaus).
> * include a patched version of jetty with correct utf-8, using that patch.
> * remove jetty and include a different container instead.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message