lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Uwe Schindler" <...@thetaphi.de>
Subject RE: [jira] [Updated] (LUCENE-3065) NumericField should be stored in binary format in index (matching Solr's format)
Date Thu, 05 May 2011 16:16:25 GMT
Sorry, I did not want to delete this one, my huper duper browser gots totally confused and
disturbed...

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: uwe@thetaphi.de

> -----Original Message-----
> From: Uwe Schindler (JIRA) [mailto:jira@apache.org]
> Sent: Thursday, May 05, 2011 6:13 PM
> To: dev@lucene.apache.org
> Subject: [jira] [Updated] (LUCENE-3065) NumericField should be stored in
> binary format in index (matching Solr's format)
> 
> 
>      [ https://issues.apache.org/jira/browse/LUCENE-
> 3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
> 
> Uwe Schindler updated LUCENE-3065:
> ----------------------------------
> 
>     Comment: was deleted
> 
> (was: Ideally this could be done with the schema-like approach of one of the
> GSoC projects?
> 
> We already discussed about that: We can use the FieldsReader/FieldsWriter
> type flag (which currently says, binary/text and compressed (unused now))
> in the index file format to mark a field as NumericField. In that case,
> Document.getField() would return the NumericField instance.
> 
> For Lucene backwards we should still support creating "text-only" fields.
> 
> The new binary format would also be compatible with solr, as on getField,
> Solr would get a NumericField and can decide using instanceof what to do.
> Old Solr indexes without the NumericField marker flag would return as
> byte[], in which case, solr would do the decoding.
> 
> For storing on index side, Solr could move to NumericField completely (I dont
> like the current approach using NumericTokenStream and to/fromInternal
> wrappers around conventional Field).)
> 
> > NumericField should be stored in binary format in index (matching
> > Solr's format)
> > ----------------------------------------------------------------------
> > ----------
> >
> >                 Key: LUCENE-3065
> >                 URL: https://issues.apache.org/jira/browse/LUCENE-3065
> >             Project: Lucene - Java
> >          Issue Type: Improvement
> >          Components: Index
> >            Reporter: Michael McCandless
> >            Assignee: Uwe Schindler
> >            Priority: Minor
> >             Fix For: 3.2, 4.0
> >
> >         Attachments: LUCENE-3065.patch, LUCENE-3065.patch,
> > LUCENE-3065.patch, LUCENE-3065.patch, LUCENE-3065.patch,
> > LUCENE-3065.patch, LUCENE-3065.patch
> >
> >
> > (Spinoff of LUCENE-3001)
> > Today when writing stored fields we don't record that the field was a
> > NumericField, and so at IndexReader time you get back an "ordinary"
> > Field and your number has turned into a string.  See
> > https://issues.apache.org/jira/browse/LUCENE-
> 1701?focusedCommentId=127
> > 21972&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> tab
> > panel#comment-12721972 We have spare bits already in stored fields,
> > so, we should use one to record that the field is numeric, and then encode
> the numeric field in Solr's more-compact binary format.
> > A nice side-effect is we fix the long standing issue that you don't get a
> NumericField back when loading your document.
> 
> --
> This message is automatically generated by JIRA.
> For more information on JIRA, see: http://www.atlassian.com/software/jira
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional
> commands, e-mail: dev-help@lucene.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message