Return-Path: Delivered-To: apmail-lucene-solr-user-archive@locus.apache.org Received: (qmail 54473 invoked from network); 20 Jun 2006 15:35:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 20 Jun 2006 15:35:30 -0000 Received: (qmail 53007 invoked by uid 500); 20 Jun 2006 15:35:28 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 52959 invoked by uid 500); 20 Jun 2006 15:35:28 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 52927 invoked by uid 99); 20 Jun 2006 15:35:28 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Jun 2006 08:35:28 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=RCVD_IN_BL_SPAMCOP_NET,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of richmondmike@gmail.com designates 66.249.92.175 as permitted sender) Received: from [66.249.92.175] (HELO ug-out-1314.google.com) (66.249.92.175) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 Jun 2006 08:35:26 -0700 Received: by ug-out-1314.google.com with SMTP id u2so2916906uge for ; Tue, 20 Jun 2006 08:35:05 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=djB64ohHedOi8fbdsB9sam7yg9w8nXMfN2KtMjYWLnEe8uSdLAsqzujV9azvtxd6SUyN8S6ynz8LsVFEFKNirqaN0GXKOXf4DyCksPMCm5CKY6402XW4RcmLpthfkobC07wmd5/PjekHkZL7NKkrdlsMjmLusQtGcXMAUFYMKFY= Received: by 10.67.96.14 with SMTP id y14mr6719637ugl; Tue, 20 Jun 2006 08:35:05 -0700 (PDT) Received: by 10.66.223.19 with HTTP; Tue, 20 Jun 2006 08:35:05 -0700 (PDT) Message-ID: <931ca11b0606200835q6d05e91dx599f215d9b8c690d@mail.gmail.com> Date: Tue, 20 Jun 2006 11:35:05 -0400 From: "Mike Richmond" To: solr-user@lucene.apache.org Subject: Re: Invalid XML returned from Solr In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <931ca11b0606200804u7b6d9906u23068fe4db0e2745@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hi Yonik, Thanks for the quick reply. I am willing to give you access to my index, config files, or any other pieces that you may need if it would help. I am basically running the example application (which uses Jetty), but with a modified schema.xml and a couple other small changes. I'll look into giving Tomcat a try over Jetty. --Mike On 6/20/06, Yonik Seeley wrote: > On 6/20/06, Mike Richmond wrote: > > I have a application that I recently ported to Solr and am running > > into a few problems with the XML responses from Solr. An XML response > > which came from a Solr query, returned XML data that was not properly > > escaped (no CDATA tag, or entity substitution). In particular the > > "summary" field contains '<' characters. An example of such a response > > can be found here: http://www.willetts.com/mike/response.xml > > Hmmm, that is interesting... I haven't seen that before. > I'll try and duplicate it with your example "summary" field. > > > On another note: > > I also noticed that I get non-utf8 characters in the response even > > though the encoding line at the top of the XML document specifies utf8 > > encoding. > > Are you using the bundled version of Jetty? People have been having > problems with international chars with that. You might try using > Tomcat. > > > I did not see anywhere in the XMLWriter code that checked > > the encoding of the output. Is this by design, or am I missing > > something? > > By design... XMLWriter writes java characters and strings, and the > servlet container handles encoding to UTF-8. > > -Yonik >