Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 16393 invoked from network); 9 Mar 2011 13:21:22 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 9 Mar 2011 13:21:22 -0000 Received: (qmail 34748 invoked by uid 500); 9 Mar 2011 13:21:21 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 34699 invoked by uid 500); 9 Mar 2011 13:21:21 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 34692 invoked by uid 99); 9 Mar 2011 13:21:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Mar 2011 13:21:21 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Mar 2011 13:21:20 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 7E59F3A03B5 for ; Wed, 9 Mar 2011 13:20:59 +0000 (UTC) Date: Wed, 9 Mar 2011 13:20:59 +0000 (UTC) From: "Robert Muir (JIRA)" To: dev@lucene.apache.org Message-ID: <1973386726.7864.1299676859513.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <625872369.15.1298659341832.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Commented: (SOLR-2381) The included jetty server does not support UTF-8 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/SOLR-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13004534#comment-13004534 ] Robert Muir commented on SOLR-2381: ----------------------------------- bq. Quickest fix would be to use the working code snippet from jetty-7.3.1 and replace the buggy jetty-6.1.26-patched-JETTY-1340. There's nothing quick about fixing bugs in jetty at all: for example the 6.1 branch's unit test suite does not even reliably pass out of box, making it difficult to test changes. I'm certainly going to fix it, but its going to take probably a day of my time to ensure that its done safely (just like it took a day for me to fix the previous jetty bug on this issue). {quote} Unfortunately, as we are switching from FAST System to Solr, all our Interfaces are using XML. We never had any problems with FAST, XML and UTF-8. It would be a mess reworking everything to JSON just for Solr. {quote} I'm not really concerned at all with what FAST does or doesn't do. I still stand by my statement that I strongly recommend against the use of XML (in general, nothing to do with Jetty) if you need correct unicode support and are using java-based components. This is just my practical advice based on building applications that have to work with all of unicode. You won't be reworking just for Solr, its pretty likely as your system grows you will run into other unicode bugs in java-based XML libraries, too. > The included jetty server does not support UTF-8 > ------------------------------------------------ > > Key: SOLR-2381 > URL: https://issues.apache.org/jira/browse/SOLR-2381 > Project: Solr > Issue Type: Bug > Reporter: Robert Muir > Assignee: Robert Muir > Priority: Blocker > Fix For: 3.1, 4.0 > > Attachments: SOLR-2381.patch, SOLR-2381_xmltest.patch, SOLR-ServletOutputWriter.patch, jetty-6.1.26-patched-JETTY-1340.jar, jetty-util-6.1.26-patched-JETTY-1340.jar, post_utf8enhanced.sh, utf8enhanced.xml > > > Some background here: http://www.lucidimagination.com/search/document/6babe83bd4a98b64/which_unicode_version_is_supported_with_lucene > Some possible solutions: > * wait and see if we get resolution on http://jira.codehaus.org/browse/JETTY-1340. To be honest, I am not even sure where jetty is being maintained (there is a separate jetty project at eclipse.org with another bugtracker, but the older releases are at codehaus). > * include a patched version of jetty with correct utf-8, using that patch. > * remove jetty and include a different container instead. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org