lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chantal Ackermann <chantal.ackerm...@btelligent.de>
Subject Re: MoreLikeThis > interestingTerms : SortableIntField breaks XML
Date Tue, 10 Nov 2009 13:54:53 GMT
Hi Yonik,

I'll do that. Is this a general requirement: that the terms have to be 
externalized? Because the TermVectorComponent doesn't externalize 
either. Shall the ticket mention that?

Thanks,
Chantal


Response using TermVectorComponent:

<lst name="termVectors">
−
<lst name="doc-0">
<str name="uniqueKey">#8;#0;#0;ϐ㭍</str>
<lst name="cat">
−
<lst name="€#0;Ȁ">
<int name="tf">1</int>
<int name="df">93962</int>
</lst>
−
<lst name="€#0;ࠂ">
<int name="tf">1</int>
<int name="df">1165</int>
</lst>
</lst>
...

Yonik Seeley schrieb:
> On Tue, Nov 10, 2009 at 8:01 AM, Chantal Ackermann
> <chantal.ackermann@btelligent.de> wrote:
>> I've just realised that this line in the result for an MLT query is broken:
>>
>> <float name="decade:€#0;ߐ">0.2517573</float>
>> (this is the last child of element "interestingTerms" see more output below)
> 
> Looks like MLT doesn't "externalize" terms... can you open a JIRA issue?
> 
> -Yonik
> http://www.lucidimagination.com
> 
> 
>> decade should contain the int "2000" from what I can see in the results for
>> that query. The representation of "2000" as SortableInt seems to contain
>> characters that break the XML output?
>>
>> Cheers,
>> Chantal
>>
>>
>> .../solr/mlt?mlt.mindf=50&sort=score+desc,start_date+asc,sid+asc
>> &fl=id,year,decade,title,start_date,score,sid
>> &mlt.fl=cat,participant,decade,country&start=0
>> &q=id:32883780&mlt.mintf=1&mlt.match.include=true&rows=50
>> &version=1&mlt.interestingTerms=details&mlt.boost=true
>>
>>
>> ...
>> <doc>
>> <float name="score">3.9147425</float>
>> <int name="decade">2000</int>
>> <long name="id">32725355</long>
>> <long name="sid">1518610</long>
>> <date name="start_date">2009-11-14T06:00:00Z</date>
>> <str name="title">Smoking</str>
>> <int name="year">2002</int>
>> </doc>
>> ...
>> <lst name="interestingTerms">
>> <float name="participant:Nichelle Nichols">1.0</float>
>> <float name="participant:James Coburn">0.9949831</float>
>> <float name="participant:John Debney">0.93337345</float>
>> <float name="cat:€#0;Ă">0.5553121</float>
>> <float name="country:CDN">0.5469994</float>
>> <float name="cat:€#0;Ā">0.38832104</float>
>> <float name="country:USA">0.31487963</float>
>> <float name="decade:€#0;ߐ">0.2517573</float>
>> </lst>
>>

Mime
View raw message