lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Will Milspec <>
Subject overhead of empty, unused fields
Date Thu, 18 Aug 2011 21:22:35 GMT
hi all,

What are the cost of unused field types?

Our application supports multiple languages. We envision separate
Lucene/Solr fields (and field types) per language (conten_en, content_fr,

We thought of a few optons:
a) auto-generating the 'multilingual' portion of the schema based on the
application's languages,
b) include fields-and-types for all languagues

In A, if an implemenation only used French and Chinese, the schema  would
only have content_en and conten_zh_CN fields-and-types.

In B, the implementation would have all field types, but a give document
would only have two fields

A seems "more efficiient", but less work.  The downside: if a user wants to
add a language, they would need to regenerate the schema (i.e. add
fields-and-types for "ja")

How much do empty field types and fields? Do a dozen-or-so unused field
types hurt scalability of indexing or search?



  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message