lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tom Burton-West <tburt...@umich.edu>
Subject Re: Solr 4.0 Beta, termIndexInterval vs termIndexDivisor vs termInfosIndexDivisor
Date Fri, 07 Sep 2012 20:54:37 GMT
Thanks Robert,

>>if not, just customize blocktree's params with a CodecFactory in solr,
>>or even pick another implementation (FixedGap, VariableGap, whatever).

Still trying to get my head around 4.0 and flexible indexing.  I'll take
another look at Mike's and your presentations.  I'm trying to figure out
how to get from the Lucene JavaDocs you pointed out  to how to specify
things in Solr and it's config files..

Is there an example CodecFactory somewhere I could look at?  Also is
Is there an example somewhere of how to specify a CodecFactory/Codec in
Solr using the schema.xml or solrconfig.xml?

Is there some simple way to specify minBlockSize and maxBlockSize in
schema.xml?

Once I get this all working and understand it, I'll be happy to draft some
documentation.

I'm really looking forward to experimenting with 4.0!

Tom



Tom
On Fri, Sep 7, 2012 at 2:58 PM, Robert Muir <rcmuir@gmail.com> wrote:

> On Fri, Sep 7, 2012 at 2:19 PM, Tom Burton-West <tburtonw@umich.edu>
> wrote:
> > Thanks Robert,
> >
> > I'll have to spend some time understanding the default codec for Solr
> 4.0.
> > Did I miss something in the changes file?
>
> http://lucene.apache.org/core/4_0_0-BETA/
>
> see the file formats section, especially
>
> http://lucene.apache.org/core/4_0_0-BETA/core/org/apache/lucene/codecs/lucene40/Lucene40PostingsFormat.html#Termdictionary
>
> (since blocktree "covers" term dictionary and terms index)
>
> >
> >  I'll be digging into the default codec docs and testing sometime in next
> > week  or two (with a 2 billion term index)  If I understand it well
> enough,
> > I'll be happy to draft some changes up for either the wiki or Solr the
> > example solrconfig.xml  file.
>
> right i think we should remove these parameters.
>
> >
> > Does this mean that the default codec will reduce memory use for the
> terms
> > index enough so I don't need to use either of these settings to deal with
> > my > 2 billion term indexes?
>
> probably. i dont know enough about your terms or how much RAM you have
> to say for sure.
>
> if not, just customize blocktree's params with a CodecFactory in solr,
> or even pick another implementation (FixedGap, VariableGap, whatever).
>
> the interval/divisor stuff is mostly only useful if you are not
> reindexing from scratch: e.g. if you are gonna plop your 3.x index
> into 4.x then you should set
> those to whatever you were using before, since it will be using
> PreflexCodec to read those.
>
> --
> lucidworks.com
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message