lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ravikumar Govindarajan <ravikumar.govindara...@gmail.com>
Subject Block-Join and number of child-documents
Date Thu, 31 Jul 2014 07:29:29 GMT
We are planning to use block-indexing and ToChildBlockJoin queries...

Each parent-doc can contain anywhere between 1-2000 children-docs and is
highly variable.

A sample user-stats is as follows

1. No.of. parent-docs = 500K
2. Children -per parent = 50
3. Total-docs = 25 Million
4. Size occupied = 7 GB

A given index contains many such users but we are planning to limit the
size to 32GB per-index. When exceeded, addDocuments() call moves to newer
index.

The number of docs in one 32GB index sounds very scary. Are queries
affected by such high number of documents? Filters, AcceptDocs etc...
 could also become memory-heavy no?

Is Block-Join the correct fit for the above scenario?

--
Ravi

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message