lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jason Hellman <jhell...@innoventsolutions.com>
Subject Re: multiple cache for same field
Date Mon, 20 May 2013 17:25:15 GMT
Most definitely not the number of unique elements in each segment.  My 32 document sample index
(built from the default example docs data) has the following:

entry#0:
'StandardDirectoryReader(​segments_b:29 _8(​4.2.1):C32)'=>'manu_exact',class org.apache.lucene.index.SortedDocValues,0.5=>org.apache.lucene.search.FieldCacheImpl$SortedDocValuesImpl#1778857102

There is no chance for there to be 1.8 billion unique elements in that index.

On May 20, 2013, at 1:20 PM, Erick Erickson <erickerickson@gmail.com> wrote:

> Not sure, never had to worry about what they are......
> 
> On Mon, May 20, 2013 at 12:28 PM, J Mohamed Zahoor <zahoor@indix.com> wrote:
>> 
>> What is the number at the end?
>> is it the no of unique elements in each segment?
>> 
>> ./zahoor
>> 
>> 
>> On 20-May-2013, at 7:37 PM, Erick Erickson <erickerickson@gmail.com> wrote:
>> 
>>> Because the same field is split amongst a number of segments. If you
>>> look in the index directory, you should see files like _3fgm.* and
>>> _3ffm.*. Each such group represents one segment. The number of
>>> segments changes with merging etc.
>>> 
>>> Best
>>> Erick
>>> 
>>> On Mon, May 20, 2013 at 6:43 AM, J Mohamed Zahoor <zahoor@indix.com> wrote:
>>>> Hi
>>>> 
>>>> Why is that lucene field cache has multiple entries for the same field S_24.
>>>> It is a dynamic field.
>>>> 
>>>> 
>>>> 'SegmentCoreReader(owner=_3fgm(4.2.1):C7681)'=>'S_24',double,org.apache.lucene.search.FieldCache.NUMERIC_UTILS_DOUBLE_PARSER=>org.apache.lucene.search.FieldCacheImpl$DoublesFromArray#1174240382
>>>> 
>>>> 'SegmentCoreReader(owner=_3ffm(4.2.1):C1596758)'=>'S_24',double,org.apache.lucene.search.FieldCache.NUMERIC_UTILS_DOUBLE_PARSER=>org.apache.lucene.search.FieldCacheImpl$DoublesFromArray#83384344
>>>> 
>>>> 'SegmentCoreReader(owner=_3fgh(4.2.1):C2301)'=>'S_24',double,org.apache.lucene.search.FieldCache.NUMERIC_UTILS_DOUBLE_PARSER=>org.apache.lucene.search.FieldCacheImpl$DoublesFromArray#1281331764
>>>> 
>>>> 
>>>> Also, the number at the end.. does it specified the no of entries in that
cache bucket?
>>>> 
>>>> ./zahoor
>> 


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message