Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@lucene.apache.org
Received-SPF: pass (athena.apache.org: domain of jimmoefoe@gmail.com
 designates 74.125.82.48 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=t00BL7J5TDSpr7mdLuV9j01osKfXwvKWhRWFrgRauZ3LNEPjsgRB+BGjXf9tlrs34y
         PJZq2wC0KyOFjbibuNEbQFKInDpcugdMRBcS1fdAbZZCvHLSIdvsay/gsn4VqWTdGWBF
         kJLzzJvwtPrG9vXVDJXakMQc+3UBnLPhiajWg=
MIME-Version: 1.0
In-Reply-To: <AANLkTikQmZUbpQQ32=JBq3ioREQ6LurPEMb1GbJGWurP@mail.gmail.com>
References: <AANLkTikdw3nmun0KJU7aL8Taf=iSG9CM2Z5JwRF0hR_C@mail.gmail.com>
	<AANLkTinpr=orgYh7O=yya5tJoxeLPSKr6ctsGJezN_7g@mail.gmail.com>
	<AANLkTimE20KEuVZve31Q9bE4HkZHLPEttxbbbk0cnxet@mail.gmail.com>
	<AANLkTinBxZe1pohynOjyphF5LcbR47YzeA989zPj7r=9@mail.gmail.com>
	<AANLkTin=vh+L=abN+NX9Z80_WEpCQZQJvB3UN_J_C56w@mail.gmail.com>
	<AANLkTikeN12TwxbCkah6E+9CL+75fWnSeKodb6riNFEz@mail.gmail.com>
	<AANLkTinMtWdrgyshu8LUOs5rAqU16KOuefcap22t0aJp@mail.gmail.com>
	<AANLkTim9f4HZ2pLyzAVFYnYLBPTXE9m2jJSrRqz_QonV@mail.gmail.com>
	<AANLkTikiqDj--8dw9KrDmnaQF92FAr91+s-KPPutCZwt@mail.gmail.com>
	<AANLkTikQmZUbpQQ32=JBq3ioREQ6LurPEMb1GbJGWurP@mail.gmail.com>
Date: Mon, 6 Dec 2010 16:29:49 -0800
Message-ID: <AANLkTimUUDpHRfQu76x16NqkBPrg+SnvYzN1qP-YG_t9@mail.gmail.com>
Subject: Re: FieldCache usage for custom field collapse in solr 1.4
From: "Adam H." <jimmoefoe@gmail.com>
To: dev@lucene.apache.org, yonik@lucidimagination.com
Content-Type: multipart/alternative; boundary=0016367f9dc43d4dee0496c71a3c

--0016367f9dc43d4dee0496c71a3c
Content-Type: text/plain; charset=ISO-8859-1

One more comment/question -
Having looked at the Solr stats panel, I do not see detailed memory usage
for the field i'm collapsing on in the lucene FieldCache entries listings.

As I understand ( after having looked through this ticket:
https://issues.apache.org/jira/browse/SOLR-1292 ), this means that its not
an 'insanity' instance,
and so actually I am not using double the memory, but rather only have this
field in the FieldCache on the whole index level.

This got me thinking - If i'm not using any segment-level fieldcaching for
this field, there's no reason not to use an index-wide one,
as long as I can guarantee thats the only use case for this field in the
fieldcache.. is this correct?

Thanks again for helping me out with this delicate subject :)

Adam

On Mon, Dec 6, 2010 at 3:21 PM, Adam H. <jimmoefoe@gmail.com> wrote:

> ah! so just so I can get cracking on this - Can you be alittle more
> specific? e.g
>
> in my component implementation that runs in the request handling after the
> normal QueryComponent,
> How would I access the specific field value for the documents that were
> retrieved?
>
> i.e how would it fit in a code like this if at all:
>
> // docList is the matching documents for given offset/rows/query
> DocIterator it = docList.iterator();
>
>         while (it.hasNext()) {
>             docId = it.next();
>             score = it.score();
>
>
>             // this would've worked if this was stored field:
>             // reader.document(docId).get(fieldName)
>             ??
>
>         }
>
>
>
> On Mon, Dec 6, 2010 at 2:57 PM, Yonik Seeley <yonik@lucidimagination.com>wrote:
>
>> On Mon, Dec 6, 2010 at 5:48 PM, Adam H. <jimmoefoe@gmail.com> wrote:
>> > In other words, using a per-segment fieldcache collection as a
>> > post-processing step (e.g after QueryComponent did its collection) does
>> not
>> > seem at all trivial, if at all possible ( is it possible? )
>>
>> Sure, it's possible, and not too hard (as long as no sort field involves
>> score).
>> Just instruct the QueryComponent to retrieve the set of all matching
>> documents, then you can use that to run then through whatever
>> collectors you want again.  I've been meaning to implement this
>> optimization to field collapsing...
>>
>> Depending on the details, either replacing the QueryComponent with
>> your custom one, or inserting an additional component after the query
>> component could make sense.
>>
>> -Yonik
>> http://www.lucidimagination.com
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: dev-help@lucene.apache.org
>>
>>
>

--0016367f9dc43d4dee0496c71a3c
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

One more comment/question -<br>Having looked at the Solr stats panel, I do =
not see detailed memory usage for the field i&#39;m collapsing on in the lu=
cene FieldCache entries listings.<br><br>As I understand ( after having loo=
ked through this ticket: <a href=3D"https://issues.apache.org/jira/browse/S=
OLR-1292">https://issues.apache.org/jira/browse/SOLR-1292</a> ), this means=
 that its not an &#39;insanity&#39; instance,<br>
and so actually I am not using double the memory, but rather only have this=
 field in the FieldCache on the whole index level. <br><br>This got me thin=
king - If i&#39;m not using any segment-level fieldcaching for this field, =
there&#39;s no reason not to use an index-wide one,<br>
as long as I can guarantee thats the only use case for this field in the fi=
eldcache.. is this correct?<br><br>Thanks again for helping me out with thi=
s delicate subject :)<br><br>Adam<br><br><div class=3D"gmail_quote">On Mon,=
 Dec 6, 2010 at 3:21 PM, Adam H. <span dir=3D"ltr">&lt;<a href=3D"mailto:ji=
mmoefoe@gmail.com">jimmoefoe@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin: 0pt 0pt 0pt 0.8ex; borde=
r-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">ah! so just so I =
can get cracking on this - Can you be alittle more specific? e.g<br><br>in =
my component implementation that runs in the request handling after the nor=
mal QueryComponent,<br>
How would I access the specific field value for the documents that were ret=
rieved? <br>
<br>i.e how would it fit in a code like this if at all:<br><br>// docList i=
s the matching documents for given offset/rows/query<br>DocIterator it =3D =
docList.iterator();<br><br>=A0=A0=A0 =A0=A0=A0 while (it.hasNext()) {<br>=
=A0=A0=A0 =A0=A0=A0 =A0=A0=A0 docId =3D it.next();<br>

=A0=A0=A0 =A0=A0=A0 =A0=A0=A0 score =3D it.score();<br><br><br>=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0 // this would&#39;ve worked if this was stored field:=
<br>=A0=A0=A0 =A0=A0=A0 =A0=A0=A0 // reader.document(docId).get(fieldName)<=
br>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 ??<div><div></div><div class=3D"h5"><b=
r>=A0=A0=A0=A0=A0=A0=A0 }<br>
<br><br><br><div class=3D"gmail_quote">
On Mon, Dec 6, 2010 at 2:57 PM, Yonik Seeley <span dir=3D"ltr">&lt;<a href=
=3D"mailto:yonik@lucidimagination.com" target=3D"_blank">yonik@lucidimagina=
tion.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); p=
adding-left: 1ex;">

<div>On Mon, Dec 6, 2010 at 5:48 PM, Adam H. &lt;<a href=3D"mailto:jimmoefo=
e@gmail.com" target=3D"_blank">jimmoefoe@gmail.com</a>&gt; wrote:<br>
&gt; In other words, using a per-segment fieldcache collection as a<br>
&gt; post-processing step (e.g after QueryComponent did its collection) doe=
s not<br>
&gt; seem at all trivial, if at all possible ( is it possible? )<br>
<br>
</div>Sure, it&#39;s possible, and not too hard (as long as no sort field i=
nvolves score).<br>
Just instruct the QueryComponent to retrieve the set of all matching<br>
documents, then you can use that to run then through whatever<br>
collectors you want again. =A0I&#39;ve been meaning to implement this<br>
optimization to field collapsing...<br>
<br>
Depending on the details, either replacing the QueryComponent with<br>
your custom one, or inserting an additional component after the query<br>
component could make sense.<br>
<div><br>
-Yonik<br>
<a href=3D"http://www.lucidimagination.com" target=3D"_blank">http://www.lu=
cidimagination.com</a><br>
<br>
</div><div><div></div><div>------------------------------------------------=
---------------------<br>
To unsubscribe, e-mail: <a href=3D"mailto:dev-unsubscribe@lucene.apache.org=
" target=3D"_blank">dev-unsubscribe@lucene.apache.org</a><br>
For additional commands, e-mail: <a href=3D"mailto:dev-help@lucene.apache.o=
rg" target=3D"_blank">dev-help@lucene.apache.org</a><br>
<br>
</div></div></blockquote></div><br>
</div></div></blockquote></div><br>

--0016367f9dc43d4dee0496c71a3c--