lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: codec="Pulsing" per field broken?
Date Sun, 11 Dec 2011 17:37:33 GMT
On Sun, Dec 11, 2011 at 11:34 AM, eks dev <eksdev@yahoo.co.uk> wrote:
> on the latest trunk, my schema.xml with field type declaration
> containing //codec="Pulsing"// does not work any more (throws
> exception from FieldType). It used to work wit approx. a month old
> trunk version.
>
> I didn't dig deeper, can be that the old schema.xml  was broken and
> worked by accident.
>

Hi,

The short answer is, you should change this to //postingsFormat="Pulsing40"//
See http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/core/src/test-files/solr/conf/schema_codec.xml

The longer answer is that the Codec API in lucene trunk was extended recently:
https://issues.apache.org/jira/browse/LUCENE-3490

Previously "Codec" only allowed you to customize the format of the
postings lists.
We are working to have it cover the entire index segment (at the
moment nearly everything except deletes and encoding of compound files
can be customized).

For example, look at SimpleText now:
http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/codecs/simpletext/
As you see, it now implements plain-text stored fields, term vectors,
norms, segments file, fieldinfos, etc.
See Codec.java (http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/codecs/Codec.java)
or LUCENE-3490 for more details.

Because of this, what you had before is now just "PostingsFormat", as
Pulsing is just a wrapper around a postings implementation that
inlines low frequency terms.
Lucene's default Codec uses a per-field postings setup, so you can
still configure the postings per-field, just differently.

-- 
lucidimagination.com

Mime
View raw message