lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "David Smiley (" <>
Subject Re: suggestion howto handle highly repetitive valued field
Date Tue, 11 Dec 2012 19:29:04 GMT
The indexed="true" side is quite efficient.  The stored="true" side -- not so
much, but the strings you have here are pretty small and I wouldn't worry
about it.  Solr 4.1 (unreleased) does a great job here and compresses all
the stored field data across documents.

~ David

Jie Sun wrote
> Hi -
> our indexed documents currently store solr fields like 'digest' or 'type',
> which most of our documents will end up with same value (such as 'sha1'
> for field 'digest', or 'message' for field 'type' etc).
> on each solr server, we usually have 100 of millions of documents indexed
> and with the same value on these fields (fields are stored and indexed).
> any suggestion what is the  best approach if we suspect this will be very
> inefficient on disk space usage, or is it?
> thanks!
> Jie

View this message in context:
Sent from the Solr - User mailing list archive at

View raw message