lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "David Smiley (@MITRE.org)" <DSMI...@mitre.org>
Subject Re: suggestion howto handle highly repetitive valued field
Date Tue, 11 Dec 2012 19:29:04 GMT
The indexed="true" side is quite efficient.  The stored="true" side -- not so
much, but the strings you have here are pretty small and I wouldn't worry
about it.  Solr 4.1 (unreleased) does a great job here and compresses all
the stored field data across documents.

~ David


Jie Sun wrote
> Hi -
> our indexed documents currently store solr fields like 'digest' or 'type',
> which most of our documents will end up with same value (such as 'sha1'
> for field 'digest', or 'message' for field 'type' etc).
> 
> on each solr server, we usually have 100 of millions of documents indexed
> and with the same value on these fields (fields are stored and indexed).
> 
> any suggestion what is the  best approach if we suspect this will be very
> inefficient on disk space usage, or is it?
> 
> thanks!
> Jie





-----
 Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book
--
View this message in context: http://lucene.472066.n3.nabble.com/suggestion-howto-handle-highly-repetitive-valued-field-tp4026104p4026131.html
Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message