Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 40160 invoked from network); 9 Mar 2011 12:53:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 9 Mar 2011 12:53:25 -0000 Received: (qmail 84478 invoked by uid 500); 9 Mar 2011 12:53:23 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 84364 invoked by uid 500); 9 Mar 2011 12:53:23 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 84350 invoked by uid 99); 9 Mar 2011 12:53:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Mar 2011 12:53:23 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Mar 2011 12:53:21 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 7DA883A063C for ; Wed, 9 Mar 2011 12:52:59 +0000 (UTC) Date: Wed, 9 Mar 2011 12:52:59 +0000 (UTC) From: "Bernd Fehling (JIRA)" To: dev@lucene.apache.org Message-ID: <1259993853.7800.1299675179511.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <625872369.15.1298659341832.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Commented: (SOLR-2381) The included jetty server does not support UTF-8 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13004519#comment-13004519 ] Bernd Fehling commented on SOLR-2381: ------------------------------------- I just debugged jetty-6.1.26-patched-JETTY-1340 and located the bug. As I already said above, it is due to buffer size of 512 bytes for output and the surrogates. If the buffer is filled up to 510 bytes and the next one is a UTF-8 above BMP (greater than 2 bytes) then jetty is in trouble. Quickest fix would be to use the working code snippet from jetty-7.3.1 and replace the buggy jetty-6.1.26-patched-JETTY-1340. Unfortunately, as we are switching from FAST System to Solr, all our Interfaces are using XML. We never had any problems with FAST, XML and UTF-8. It would be a mess reworking everything to JSON just for Solr. > The included jetty server does not support UTF-8 > ------------------------------------------------ > > Key: SOLR-2381 > URL: https://issues.apache.org/jira/browse/SOLR-2381 > Project: Solr > Issue Type: Bug > Reporter: Robert Muir > Assignee: Robert Muir > Priority: Blocker > Fix For: 3.1, 4.0 > > Attachments: SOLR-2381.patch, SOLR-2381_xmltest.patch, SOLR-ServletOutputWriter.patch, jetty-6.1.26-patched-JETTY-1340.jar, jetty-util-6.1.26-patched-JETTY-1340.jar, post_utf8enhanced.sh, utf8enhanced.xml > > > Some background here: http://www.lucidimagination.com/search/document/6babe83bd4a98b64/which_unicode_version_is_supported_with_lucene > Some possible solutions: > * wait and see if we get resolution on http://jira.codehaus.org/browse/JETTY-1340. To be honest, I am not even sure where jetty is being maintained (there is a separate jetty project at eclipse.org with another bugtracker, but the older releases are at codehaus). > * include a patched version of jetty with correct utf-8, using that patch. > * remove jetty and include a different container instead. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org