Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-dev@lucene.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
From: "Uwe Schindler" <uwe@thetaphi.de>
To: <java-dev@lucene.apache.org>
References: <786fde50903300050x576cb814g550df2128b45fb60@mail.gmail.com>
 <9ac0c6aa0903300107t35b31f80u5df09d618692178c@mail.gmail.com>
 <786fde50903300120s411c91cch5df579c651c104fb@mail.gmail.com>
Subject: RE: Bug in TopFieldCollector?
Date: Mon, 30 Mar 2009 10:39:32 +0200
Message-ID: <2139BAF9FE46466BB404DDA4962A8AB9@VEGA>
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_000_0003_01C9B123.D00EC390"
In-Reply-To: <786fde50903300120s411c91cch5df579c651c104fb@mail.gmail.com>
Thread-Index: AcmxEHZaTq9qdWSaQfq+xu64Dwt5UAAAdasA

------=_NextPart_000_0003_01C9B123.D00EC390
Content-Type: text/plain;
	charset="us-ascii"
Content-Transfer-Encoding: 7bit

Why not call IndexSearcher.getIndexReader().getSequentialSubReaders() (see
http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc/all/org/apach
e/lucene/index/IndexReader.html). Its public and documented as this:

 
public
<http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc/all/org/apac
he/lucene/index/IndexReader.html> IndexReader[] getSequentialSubReaders()

 
Expert: returns the sequential sub readers that this reader is logically
composed of. For example, IndexSearcher uses this API to drive searching by
one sub reader at a time. If this reader is not composed of sequential child
readers, it should return null. If this method returns an empty array, that
means this reader is a null reader (for example a MultiReader that has no
sub readers). 

NOTE: for a MultiSegmentReader, which is obtained by
<http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc/all/org/apac
he/lucene/index/IndexReader.html#open(java.lang.String)>
open(java.lang.String) when the index has more than one segment, you should
not use the sub-readers returned by this method to make any changes
(setNorm, deleteDocument, etc.). Doing so will likely lead to index
corruption. Use the parent reader instead. 

 
You only have the problem to replicate the code that gathers the subreaders
of the subreaders itself recursively.

 
Uwe

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: uwe@thetaphi.de

  _____  

From: Shai Erera [mailto:serera@gmail.com] 
Sent: Monday, March 30, 2009 10:20 AM
To: java-dev@lucene.apache.org
Subject: Re: Bug in TopFieldCollector?

 
Already did !

Another question - I think we somehow broke TopFieldCollector ...
Previously, in TopFieldDocCollector, it accepted an IndexReader as a
parameter, and now it requires IndexReader[], which is called subReaders.
Calling the 'fast' search methods with Sort has no problem obtaining that
IndexReader[] (and sort it), but how would someone use TopFieldCollector w/o
calling the appropriate Searcher methods?

For example, since all the Searcher methods pass in fillFields = true, I
wanted to use the method Searcher.search(Query, TopFieldCollector) in the
test case I wrote, which BTW looks like this:

  public void testSortWithoutFillFields() throws Exception {
    
    // There was previously a bug in TopFieldCollector when fillFields was
set
    // to false - the same doc and score was set in ScoreDoc[] array. This
test
    // asserts that if fillFields is false, the documents are set properly.
It
    // does not use Searcher's default search methods (with Sort) since all
set
    // fillFields to true.
    Sort sort = new Sort();
    int nDocs=10;
    
    TopDocsCollector tdc = new TopFieldCollector(sort, nDocs,
        new IndexReader[] { ((IndexSearcher) full).getIndexReader() },
false);
    
    full.search(new MatchAllDocsQuery(), tdc);

    ScoreDoc[] sd = tdc.topDocs().scoreDocs;
    for (int i = 1; i < sd.length; i++) {
      assertTrue(sd[i].doc != sd[i - 1].doc);
    }
  }

You'll notice that creating a TopFieldCollector now is much more complicated
and *ugly*. As a user of IndexSearcher, I can only call getIndexReader()
which returns a single IndexReader. I don't have access to gatherSubReaders
and sortSubReaders. I don't see why I should have access to them. So it
forces me to create a dummy array with a single IndexReader.

There are two ways I see to solve it:
1. Introduce a getIndexReaders() method on IndexSearcher, which will return
an array of (sorted?) IndexReader.
2. Introduce a new constructor in TopFieldCollector which accepts a single
IndexReader and make the other one package-private (for use by IndexSearcher
only). That constructor can internally create a dummy array of readers, but
at least it's private to the constructor and not exposed to the rest of the
world.

Otherwise, I think it ruins TopFieldCollector and will make it a lot less
intuitive to use. At least, people who'd want to move from
TopFieldDocCollector to TopFieldCollector, will find it very inconvenient
and strange.

What do you think? I can do that (2) as part of 1575. If (1) is better, then
I think a different issue should be opened, because we might want to return
such an array as sorted or something, which makes it less trivial.

Shai

On Mon, Mar 30, 2009 at 11:07 AM, Michael McCandless
<lucene@mikemccandless.com> wrote:

Looks like quite a bug, Shai!  Thanks.  It came in with LUCENE-1483.
I would say add test case & fix it under 1575.

Mike


On Mon, Mar 30, 2009 at 3:50 AM, Shai Erera <serera@gmail.com> wrote:
> Hi
>
> As I prepared the patch for 1575, I noticed a strange implementation in
> TopFieldCollector's topDocs():
>
>     ScoreDoc[] scoreDocs = new ScoreDoc[queue.size()];
>     if (fillFields) {
>       for (int i = queue.size() - 1; i >= 0; i--) {
>         scoreDocs[i] = queue.fillFields((FieldValueHitQueue.Entry)
> queue.pop());
>       }
>     } else {
>       Entry entry = (FieldValueHitQueue.Entry) queue.pop();
>       for (int i = queue.size() - 1; i >= 0; i--) {
>         scoreDocs[i] = new FieldDoc(entry.docID,
>                                     entry.score);
>       }
>     }
>
>     return new TopFieldDocs(totalHits, scoreDocs, queue.getFields(),
> maxScore);
>
>
> Notice that if fillFields is true, then documents are popped from the
queue.
> However if it's false, then the first document is popped out of the queue
> and used to populate the entire ScoreDoc[]? I believe that's wrong, right?
> Otherwise, the returned TopFieldDocs.scoreDocs array will include the same
> document and score?
>
> I noticed there's no test case for that ... TopFieldCollector's
constructor
> is called only from IndexSearcher.search(Weight, Filter, int, Sort,
boolean
> /* fillFields */) which is called from IndexSearcher.search(Weight,
Filter,
> int, sort) with fillFields always set to true. So perhaps that's why this
> hasn't showed up?
>
> BTW, the now deprecated TopFieldDocCollector's topDocs() implementation
> looks like this:
>
>     FieldSortedHitQueue fshq = (FieldSortedHitQueue)hq;
>     ScoreDoc[] scoreDocs = new ScoreDoc[fshq.size()];
>     for (int i = fshq.size()-1; i >= 0; i--)      // put docs in array
>       scoreDocs[i] = fshq.fillFields ((FieldDoc) fshq.pop());
>
>     return new TopFieldDocs(totalHits, scoreDocs,
>                             fshq.getFields(), fshq.getMaxScore());
>
> It assumes fillFields is always true and always pops elements out of the
> queue.
>
> If this is a bug, I can fix it as part of 1575, as I'm touching that class
> anyway. I can also add a test case ... The fix is very simple BTW, just
move
> the line "Entry entry = (FieldValueHitQueue.Entry) queue.pop();" inside
the
> for loop.
>
> Shai
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

 
------=_NextPart_000_0003_01C9B123.D00EC390
Content-Type: text/html;
	charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:st1=3D"urn:schemas-microsoft-com:office:smarttags" =
xmlns=3D"http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<meta name=3DGenerator content=3D"Microsoft Word 11 (filtered medium)">
<!--[if !mso]>
<style>
v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style>
<![endif]--><o:SmartTagType
 namespaceuri=3D"urn:schemas-microsoft-com:office:smarttags" =
name=3D"PersonName"/>
<!--[if !mso]>
<style>
st1\:*{behavior:url(#default#ieooui) }
</style>
<![endif]-->
<style>
<!--
 /* Font Definitions */
 @font-face
	{font-family:Tahoma;
	panose-1:2 11 6 4 3 5 4 4 2 4;}
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0cm;
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman";}
h1
	{margin-top:12.0pt;
	margin-right:0cm;
	margin-bottom:3.0pt;
	margin-left:0cm;
	text-align:justify;
	page-break-after:avoid;
	font-size:16.0pt;
	font-family:"Times New Roman";
	font-weight:bold;}
h2
	{margin-top:12.0pt;
	margin-right:0cm;
	margin-bottom:3.0pt;
	margin-left:0cm;
	text-align:justify;
	page-break-after:avoid;
	font-size:13.0pt;
	font-family:"Times New Roman";
	font-weight:bold;}
h3
	{margin-top:8.0pt;
	margin-right:0cm;
	margin-bottom:4.0pt;
	margin-left:0cm;
	page-break-after:avoid;
	font-size:11.0pt;
	font-family:"Times New Roman";
	font-weight:normal;
	font-style:italic;}
a:link, span.MsoHyperlink
	{color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{color:blue;
	text-decoration:underline;}
p
	{mso-margin-top-alt:auto;
	margin-right:0cm;
	mso-margin-bottom-alt:auto;
	margin-left:0cm;
	font-size:12.0pt;
	font-family:"Times New Roman";}
code
	{font-family:"Courier New";}
span.E-MailFormatvorlage17
	{mso-style-type:personal-reply;
	font-family:Arial;
	color:navy;}
@page Section1
	{size:21.0cm 842.0pt;
	margin:59.55pt 2.0cm 59.55pt 2.0cm;}
div.Section1
	{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
 <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
 <o:shapelayout v:ext=3D"edit">
  <o:idmap v:ext=3D"edit" data=3D"1" />
 </o:shapelayout></xml><![endif]-->
</head>

<body lang=3DDE link=3Dblue vlink=3Dblue>

<div class=3DSection1>

<p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span =
lang=3DEN-GB
style=3D'font-size:10.0pt;font-family:Arial;color:navy'>Why not call
IndexSearcher.getIndexReader().getSequentialSubReaders() (see <a
href=3D"http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc/al=
l/org/apache/lucene/index/IndexReader.html">http://hudson.zones.apache.or=
g/hudson/job/Lucene-trunk/javadoc/all/org/apache/lucene/index/IndexReader=
.html</a>).
Its public and documented as this:<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span =
lang=3DEN-GB
style=3D'font-size:10.0pt;font-family:Arial;color:navy'><o:p>&nbsp;</o:p>=
</span></font></p>

<p class=3DMsoNormal style=3D'margin-left:36.0pt'><font size=3D3
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'>public </span><a
href=3D"http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc/al=
l/org/apache/lucene/index/IndexReader.html"
title=3D"class in org.apache.lucene.index"><span =
lang=3DEN-GB>IndexReader</span></a></font><span
lang=3DEN-GB>[] <b><span =
style=3D'font-weight:bold'>getSequentialSubReaders</span></b>()<o:p></o:p=
></span></p>

<p class=3DMsoNormal style=3D'margin-left:36.0pt'><font size=3D3
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p>&nbsp;</o:p></span></font></p>

<p class=3DMsoNormal style=3D'margin-left:36.0pt'><font size=3D3
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'>Expert:
returns the sequential sub readers that this reader is logically =
composed of. For
example, IndexSearcher uses this API to drive searching by one sub =
reader at a
time. If this reader is not composed of sequential child readers, it =
should
return null. If this method returns an empty array, that means this =
reader is a
null reader (for example a MultiReader that has no sub readers). =
<o:p></o:p></span></font></p>

<p style=3D'margin-left:36.0pt'><font size=3D3 face=3D"Times New =
Roman"><span
lang=3DEN-GB style=3D'font-size:12.0pt'>NOTE: for a MultiSegmentReader, =
which is
obtained by </span><a
href=3D"http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc/al=
l/org/apache/lucene/index/IndexReader.html#open(java.lang.String)"><code>=
<font
size=3D2 face=3D"Courier New"><span lang=3DEN-GB =
style=3D'font-size:10.0pt'>open(java.lang.String)</span></font></code></a=
></font><span
lang=3DEN-GB> when the index has more than one segment, you should not =
use the
sub-readers returned by this method to make any changes (setNorm,
deleteDocument, etc.). Doing so will likely lead to index corruption. =
Use the
parent reader instead. <o:p></o:p></span></p>

<p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span =
lang=3DEN-GB
style=3D'font-size:10.0pt;font-family:Arial;color:navy'><o:p>&nbsp;</o:p>=
</span></font></p>

<p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span =
lang=3DEN-GB
style=3D'font-size:10.0pt;font-family:Arial;color:navy'>You only have =
the problem
to replicate the code that gathers the subreaders of the subreaders =
itself recursively.<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span =
lang=3DEN-GB
style=3D'font-size:10.0pt;font-family:Arial;color:navy'><o:p>&nbsp;</o:p>=
</span></font></p>

<p class=3DMsoNormal><font size=3D2 color=3Dnavy face=3DArial><span =
lang=3DEN-GB
style=3D'font-size:10.0pt;font-family:Arial;color:navy'>Uwe<o:p></o:p></s=
pan></font></p>

<div>

<p style=3D'margin-bottom:12.0pt'><font size=3D2 color=3Dnavy =
face=3D"Times New Roman"><span
style=3D'font-size:10.0pt;color:navy'>-----<br>
Uwe Schindler<br>
H.-H.-Meier-Allee 63, D-28213 Bremen<br>
<a href=3D"http://www.thetaphi.de">http://www.thetaphi.de</a><br>
eMail: uwe@thetaphi.de</span></font><o:p></o:p></p>

</div>

<div style=3D'border:none;border-left:solid blue 1.5pt;padding:0cm 0cm =
0cm 4.0pt'>

<div>

<div class=3DMsoNormal align=3Dcenter style=3D'text-align:center'><font =
size=3D3
face=3D"Times New Roman"><span style=3D'font-size:12.0pt'>

<hr size=3D2 width=3D"100%" align=3Dcenter tabindex=3D-1>

</span></font></div>

<p class=3DMsoNormal><b><font size=3D2 face=3DTahoma><span =
style=3D'font-size:10.0pt;
font-family:Tahoma;font-weight:bold'>From:</span></font></b><font =
size=3D2
face=3DTahoma><span style=3D'font-size:10.0pt;font-family:Tahoma'> Shai =
Erera
[mailto:serera@gmail.com] <br>
<b><span style=3D'font-weight:bold'>Sent:</span></b> Monday, March 30, =
2009 10:20
AM<br>
<b><span style=3D'font-weight:bold'>To:</span></b> <st1:PersonName =
w:st=3D"on">java-dev@lucene.apache.org</st1:PersonName><br>
<b><span style=3D'font-weight:bold'>Subject:</span></b> Re: Bug in
TopFieldCollector?</span></font><o:p></o:p></p>

</div>

<p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span =
style=3D'font-size:
12.0pt'><o:p>&nbsp;</o:p></span></font></p>

<div>

<p class=3DMsoNormal style=3D'margin-bottom:12.0pt'><font size=3D3
face=3D"Times New Roman"><span style=3D'font-size:12.0pt'>Already did =
!<br>
<br>
Another question - I think we somehow broke TopFieldCollector ... =
Previously,
in TopFieldDocCollector, it accepted an IndexReader as a parameter, and =
now it
requires IndexReader[], which is called subReaders. Calling the 'fast' =
search
methods with Sort has no problem obtaining that IndexReader[] (and sort =
it),
but how would someone use TopFieldCollector w/o calling the appropriate
Searcher methods?<br>
<br>
For example, since all the Searcher methods pass in fillFields =3D true, =
I wanted
to use the method Searcher.search(Query, TopFieldCollector) in the test =
case I
wrote, which BTW looks like this:<br>
<br>
&nbsp; public void testSortWithoutFillFields() throws Exception {<br>
&nbsp;&nbsp;&nbsp; <br>
&nbsp;&nbsp;&nbsp; // There was previously a bug in TopFieldCollector =
when
fillFields was set<br>
&nbsp;&nbsp;&nbsp; // to false - the same doc and score was set in =
ScoreDoc[]
array. This test<br>
&nbsp;&nbsp;&nbsp; // asserts that if fillFields is false, the documents =
are
set properly. It<br>
&nbsp;&nbsp;&nbsp; // does not use Searcher's default search methods =
(with
Sort) since all set<br>
&nbsp;&nbsp;&nbsp; // fillFields to true.<br>
&nbsp;&nbsp;&nbsp; Sort sort =3D new Sort();<br>
&nbsp;&nbsp;&nbsp; int nDocs=3D10;<br>
&nbsp;&nbsp;&nbsp; <br>
&nbsp;&nbsp;&nbsp; TopDocsCollector tdc =3D new TopFieldCollector(sort, =
nDocs,<br>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; new IndexReader[] { =
((IndexSearcher)
full).getIndexReader() }, false);<br>
&nbsp;&nbsp;&nbsp; <br>
&nbsp;&nbsp;&nbsp; full.search(new MatchAllDocsQuery(), tdc);<br>
<br>
&nbsp;&nbsp;&nbsp; ScoreDoc[] sd =3D tdc.topDocs().scoreDocs;<br>
&nbsp;&nbsp;&nbsp; for (int i =3D 1; i &lt; sd.length; i++) {<br>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; assertTrue(sd[i].doc !=3D sd[i - =
1].doc);<br>
&nbsp;&nbsp;&nbsp; }<br>
&nbsp; }<br>
<br>
You'll notice that creating a TopFieldCollector now is much more =
complicated
and *ugly*. As a user of IndexSearcher, I can only call getIndexReader() =
which
returns a single IndexReader. I don't have access to gatherSubReaders =
and
sortSubReaders. I don't see why I should have access to them. So it =
forces me
to create a dummy array with a single IndexReader.<br>
<br>
There are two ways I see to solve it:<br>
1. Introduce a getIndexReaders() method on IndexSearcher, which will =
return an
array of (sorted?) IndexReader.<br>
2. Introduce a new constructor in TopFieldCollector which accepts a =
single
IndexReader and make the other one package-private (for use by =
IndexSearcher
only). That constructor can internally create a dummy array of readers, =
but at
least it's private to the constructor and not exposed to the rest of the =
world.<br>
<br>
Otherwise, I think it ruins TopFieldCollector and will make it a lot =
less
intuitive to use. At least, people who'd want to move from =
TopFieldDocCollector
to TopFieldCollector, will find it very inconvenient and strange.<br>
<br>
What do you think? I can do that (2) as part of 1575. If (1) is better, =
then I
think a different issue should be opened, because we might want to =
return such
an array as sorted or something, which makes it less trivial.<br>
<br>
Shai<o:p></o:p></span></font></p>

<div>

<p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span =
style=3D'font-size:
12.0pt'>On Mon, Mar 30, 2009 at 11:07 AM, Michael McCandless &lt;<a
href=3D"mailto:lucene@mikemccandless.com">lucene@mikemccandless.com</a>&g=
t;
wrote:<o:p></o:p></span></font></p>

<p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span =
style=3D'font-size:
12.0pt'>Looks like quite a bug, Shai! &nbsp;Thanks. &nbsp;It came in =
with
LUCENE-1483.<br>
I would say add test case &amp; fix it under 1575.<br>
<br>
Mike<o:p></o:p></span></font></p>

<div>

<div>

<p class=3DMsoNormal style=3D'margin-bottom:12.0pt'><font size=3D3
face=3D"Times New Roman"><span style=3D'font-size:12.0pt'><br>
On Mon, Mar 30, 2009 at 3:50 AM, Shai Erera &lt;<a
href=3D"mailto:serera@gmail.com">serera@gmail.com</a>&gt; wrote:<br>
&gt; Hi<br>
&gt;<br>
&gt; As I prepared the patch for 1575, I noticed a strange =
implementation in<br>
&gt; TopFieldCollector's topDocs():<br>
&gt;<br>
&gt; &nbsp;&nbsp;&nbsp; ScoreDoc[] scoreDocs =3D new =
ScoreDoc[queue.size()];<br>
&gt; &nbsp;&nbsp;&nbsp; if (fillFields) {<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; for (int i =3D queue.size() - 1; i =
&gt;=3D 0;
i--) {<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; scoreDocs[i] =3D =
queue.fillFields((FieldValueHitQueue.Entry)<br>
&gt; queue.pop());<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; }<br>
&gt; &nbsp;&nbsp;&nbsp; } else {<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Entry entry =3D =
(FieldValueHitQueue.Entry)
queue.pop();<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; for (int i =3D queue.size() - 1; i =
&gt;=3D 0;
i--) {<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; scoreDocs[i] =3D new
FieldDoc(entry.docID,<br>
&gt; =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
entry.score);<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; }<br>
&gt; &nbsp;&nbsp;&nbsp; }<br>
&gt;<br>
&gt; &nbsp;&nbsp;&nbsp; return new TopFieldDocs(totalHits, scoreDocs,
queue.getFields(),<br>
&gt; maxScore);<br>
&gt;<br>
&gt;<br>
&gt; Notice that if fillFields is true, then documents are popped from =
the
queue.<br>
&gt; However if it's false, then the first document is popped out of the =
queue<br>
&gt; and used to populate the entire ScoreDoc[]? I believe that's wrong, =
right?<br>
&gt; Otherwise, the returned TopFieldDocs.scoreDocs array will include =
the same<br>
&gt; document and score?<br>
&gt;<br>
&gt; I noticed there's no test case for that ... TopFieldCollector's
constructor<br>
&gt; is called only from IndexSearcher.search(Weight, Filter, int, Sort,
boolean<br>
&gt; /* fillFields */) which is called from IndexSearcher.search(Weight,
Filter,<br>
&gt; int, sort) with fillFields always set to true. So perhaps that's =
why this<br>
&gt; hasn't showed up?<br>
&gt;<br>
&gt; BTW, the now deprecated TopFieldDocCollector's topDocs() =
implementation<br>
&gt; looks like this:<br>
&gt;<br>
&gt; &nbsp;&nbsp;&nbsp; FieldSortedHitQueue fshq =3D =
(FieldSortedHitQueue)hq;<br>
&gt; &nbsp;&nbsp;&nbsp; ScoreDoc[] scoreDocs =3D new =
ScoreDoc[fshq.size()];<br>
&gt; &nbsp;&nbsp;&nbsp; for (int i =3D fshq.size()-1; i &gt;=3D 0;
i--)&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; // put docs in array<br>
&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; scoreDocs[i] =3D fshq.fillFields =
((FieldDoc)
fshq.pop());<br>
&gt;<br>
&gt; &nbsp;&nbsp;&nbsp; return new TopFieldDocs(totalHits, =
scoreDocs,<br>
&gt;
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;
fshq.getFields(), fshq.getMaxScore());<br>
&gt;<br>
&gt; It assumes fillFields is always true and always pops elements out =
of the<br>
&gt; queue.<br>
&gt;<br>
&gt; If this is a bug, I can fix it as part of 1575, as I'm touching =
that class<br>
&gt; anyway. I can also add a test case ... The fix is very simple BTW, =
just
move<br>
&gt; the line &quot;Entry entry =3D (FieldValueHitQueue.Entry) =
queue.pop();&quot;
inside the<br>
&gt; for loop.<br>
&gt;<br>
&gt; Shai<br>
&gt;<o:p></o:p></span></font></p>

</div>

</div>

<p class=3DMsoNormal style=3D'margin-bottom:12.0pt'><font size=3D3
face=3D"Times New Roman"><span =
style=3D'font-size:12.0pt'>----------------------------------------------=
-----------------------<br>
To unsubscribe, e-mail: <a =
href=3D"mailto:java-dev-unsubscribe@lucene.apache.org">java-dev-unsubscri=
be@lucene.apache.org</a><br>
For additional commands, e-mail: <a
href=3D"mailto:java-dev-help@lucene.apache.org">java-dev-help@lucene.apac=
he.org</a><o:p></o:p></span></font></p>

</div>

<p class=3DMsoNormal><font size=3D3 face=3D"Times New Roman"><span =
style=3D'font-size:
12.0pt'><o:p>&nbsp;</o:p></span></font></p>

</div>

</div>

</div>

</body>

</html>

------=_NextPart_000_0003_01C9B123.D00EC390--