Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B3C212B36 for ; Thu, 5 May 2011 16:16:57 +0000 (UTC) Received: (qmail 60807 invoked by uid 500); 5 May 2011 16:16:56 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 60767 invoked by uid 500); 5 May 2011 16:16:56 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 60760 invoked by uid 99); 5 May 2011 16:16:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 May 2011 16:16:56 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [85.25.71.29] (HELO mail.troja.net) (85.25.71.29) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 May 2011 16:16:49 +0000 Received: from localhost (localhost.localdomain [127.0.0.1]) by mail.troja.net (Postfix) with ESMTP id 6F3E0D36004 for ; Thu, 5 May 2011 18:16:29 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at mail.troja.net Received: from mail.troja.net ([127.0.0.1]) by localhost (megaira.troja.net [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id xI2l8idgeKmi for ; Thu, 5 May 2011 18:16:24 +0200 (CEST) Received: from VEGA (port-92-196-39-101.dynamic.qsc.de [92.196.39.101]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by mail.troja.net (Postfix) with ESMTPSA id 455E3D36001 for ; Thu, 5 May 2011 18:16:24 +0200 (CEST) From: "Uwe Schindler" To: References: <1939176526.19277.1304444463325.JavaMail.tomcat@hel.zones.apache.org> <1466949571.25077.1304611983243.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1466949571.25077.1304611983243.JavaMail.tomcat@hel.zones.apache.org> Subject: RE: [jira] [Updated] (LUCENE-3065) NumericField should be stored in binary format in index (matching Solr's format) Date: Thu, 5 May 2011 18:16:25 +0200 Message-ID: <002901cc0b3f$c8688540$59398fc0$@thetaphi.de> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Mailer: Microsoft Outlook 14.0 Thread-Index: AQGxrx4TNMdFBb7YFYRjS2XS0879lJSzIQIQ Content-Language: de X-Virus-Checked: Checked by ClamAV on apache.org Sorry, I did not want to delete this one, my huper duper browser gots = totally confused and disturbed... ----- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: uwe@thetaphi.de > -----Original Message----- > From: Uwe Schindler (JIRA) [mailto:jira@apache.org] > Sent: Thursday, May 05, 2011 6:13 PM > To: dev@lucene.apache.org > Subject: [jira] [Updated] (LUCENE-3065) NumericField should be stored = in > binary format in index (matching Solr's format) >=20 >=20 > [ https://issues.apache.org/jira/browse/LUCENE- > = 3065?page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel = ] >=20 > Uwe Schindler updated LUCENE-3065: > ---------------------------------- >=20 > Comment: was deleted >=20 > (was: Ideally this could be done with the schema-like approach of one = of the > GSoC projects? >=20 > We already discussed about that: We can use the = FieldsReader/FieldsWriter > type flag (which currently says, binary/text and compressed (unused = now)) > in the index file format to mark a field as NumericField. In that = case, > Document.getField() would return the NumericField instance. >=20 > For Lucene backwards we should still support creating "text-only" = fields. >=20 > The new binary format would also be compatible with solr, as on = getField, > Solr would get a NumericField and can decide using instanceof what to = do. > Old Solr indexes without the NumericField marker flag would return as > byte[], in which case, solr would do the decoding. >=20 > For storing on index side, Solr could move to NumericField completely = (I dont > like the current approach using NumericTokenStream and to/fromInternal > wrappers around conventional Field).) >=20 > > NumericField should be stored in binary format in index (matching > > Solr's format) > > = ---------------------------------------------------------------------- > > ---------- > > > > Key: LUCENE-3065 > > URL: = https://issues.apache.org/jira/browse/LUCENE-3065 > > Project: Lucene - Java > > Issue Type: Improvement > > Components: Index > > Reporter: Michael McCandless > > Assignee: Uwe Schindler > > Priority: Minor > > Fix For: 3.2, 4.0 > > > > Attachments: LUCENE-3065.patch, LUCENE-3065.patch, > > LUCENE-3065.patch, LUCENE-3065.patch, LUCENE-3065.patch, > > LUCENE-3065.patch, LUCENE-3065.patch > > > > > > (Spinoff of LUCENE-3001) > > Today when writing stored fields we don't record that the field was = a > > NumericField, and so at IndexReader time you get back an "ordinary" > > Field and your number has turned into a string. See > > https://issues.apache.org/jira/browse/LUCENE- > 1701?focusedCommentId=3D127 > > = 21972&page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:comment- > tab > > panel#comment-12721972 We have spare bits already in stored fields, > > so, we should use one to record that the field is numeric, and then = encode > the numeric field in Solr's more-compact binary format. > > A nice side-effect is we fix the long standing issue that you don't = get a > NumericField back when loading your document. >=20 > -- > This message is automatically generated by JIRA. > For more information on JIRA, see: = http://www.atlassian.com/software/jira >=20 > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For = additional > commands, e-mail: dev-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org