lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ganesh" <emailg...@yahoo.co.in>
Subject Re: Best way to do date sort
Date Thu, 18 Jun 2009 05:30:07 GMT
I am having a million of documents per day to index. DateTime field is with minute resolution.


How much memory i save by splitting this to multiple fields (one field containintg YYYYMMDD,
one field with HH and one with MM). 

Could anyone provide me some calculation of calculating memory for sorting in case of multiple
fields. I think fieldcache will load all unique terms and it will have a pointer to the all
respective documents. If it so then splitting the date field will consume more memory.   

Regards
Ganesh

----- Original Message ----- 
From: "Erick Erickson" <erickerickson@gmail.com>
To: <java-user@lucene.apache.org>
Sent: Tuesday, January 13, 2009 7:17 PM
Subject: Re: Best way to do date sort


> This question, along with many possible answers has been
> discussed many times, so there's a wealth of information
> in the searchable archive.
> 
> The short form is "it depends". Do you want to sort? In
> that case storing a single field will cost you when sorting.
> Store the coarsest granularity you can. Consider breaking
> up the date field (i.e. one field containintg YYYYMMDD,
> perhaps one field containing HHMM or even one field
> with HH and one with MM).
> 
> This kind of strategy will save you far more space than worrying
> about strings vs number. And will sort faster. etc.
> 
> Best
> Erick
> 
> On Tue, Jan 13, 2009 at 4:07 AM, Ganesh <emailgane@yahoo.co.in> wrote:
> 
>> I am indexing and storing date time with minute resolution. I need to do
>> date range query and also do sorting on this field. I am having almost 30
>> million records spread across 20 database.
>>
>> option1:
>> To index the date time as string
>>
>> option2:
>> To index date, hour and minute separately as number.
>>
>> Which option will consume lesser memory?
>>
>> Whether date range query will also load all data using FieldImpl?
>>
>> Regards
>> Ganesh
>> Send instant messages to your online friends http://in.messenger.yahoo.com
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>>
>
Mime
View raw message