Return-Path: Delivered-To: apmail-lucene-solr-dev-archive@minotaur.apache.org Received: (qmail 9295 invoked from network); 10 Nov 2009 13:55:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 10 Nov 2009 13:55:26 -0000 Received: (qmail 17769 invoked by uid 500); 10 Nov 2009 13:55:26 -0000 Delivered-To: apmail-lucene-solr-dev-archive@lucene.apache.org Received: (qmail 17666 invoked by uid 500); 10 Nov 2009 13:55:25 -0000 Mailing-List: contact solr-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-dev@lucene.apache.org Delivered-To: mailing list solr-dev@lucene.apache.org Received: (qmail 17656 invoked by uid 99); 10 Nov 2009 13:55:25 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Nov 2009 13:55:25 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [81.201.117.186] (HELO exchange.btelligent.net) (81.201.117.186) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Nov 2009 13:55:14 +0000 Received: from [192.168.2.233] (212.202.134.166) by owa.btelligent.net (81.201.117.186) with Microsoft SMTP Server (TLS) id 8.1.291.1; Tue, 10 Nov 2009 14:54:53 +0100 Message-ID: <4AF970AD.2030305@btelligent.de> Date: Tue, 10 Nov 2009 14:54:53 +0100 From: Chantal Ackermann Organization: b.telligent GmbH User-Agent: Thunderbird 2.0.0.23 (Windows/20090812) MIME-Version: 1.0 To: "solr-dev@lucene.apache.org" , "yonik@lucidimagination.com" Subject: Re: MoreLikeThis > interestingTerms : SortableIntField breaks XML References: <4AF96440.60304@btelligent.de> In-Reply-To: Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org Hi Yonik, I'll do that. Is this a general requirement: that the terms have to be externalized? Because the TermVectorComponent doesn't externalize either. Shall the ticket mention that? Thanks, Chantal Response using TermVectorComponent: #8;#0;#0;ϐ㭍 1 93962 1 1165 ... Yonik Seeley schrieb: > On Tue, Nov 10, 2009 at 8:01 AM, Chantal Ackermann > wrote: >> I've just realised that this line in the result for an MLT query is broken: >> >> 0.2517573 >> (this is the last child of element "interestingTerms" see more output below) > > Looks like MLT doesn't "externalize" terms... can you open a JIRA issue? > > -Yonik > http://www.lucidimagination.com > > >> decade should contain the int "2000" from what I can see in the results for >> that query. The representation of "2000" as SortableInt seems to contain >> characters that break the XML output? >> >> Cheers, >> Chantal >> >> >> .../solr/mlt?mlt.mindf=50&sort=score+desc,start_date+asc,sid+asc >> &fl=id,year,decade,title,start_date,score,sid >> &mlt.fl=cat,participant,decade,country&start=0 >> &q=id:32883780&mlt.mintf=1&mlt.match.include=true&rows=50 >> &version=1&mlt.interestingTerms=details&mlt.boost=true >> >> >> ... >> >> 3.9147425 >> 2000 >> 32725355 >> 1518610 >> 2009-11-14T06:00:00Z >> Smoking >> 2002 >> >> ... >> >> 1.0 >> 0.9949831 >> 0.93337345 >> 0.5553121 >> 0.5469994 >> 0.38832104 >> 0.31487963 >> 0.2517573 >> >>