incubator-blur-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron McCurry (JIRA)" <j...@apache.org>
Subject [jira] [Created] (BLUR-220) Support for humongous Rows
Date Thu, 29 Aug 2013 10:20:51 GMT
Aaron McCurry created BLUR-220:
----------------------------------

             Summary: Support for humongous Rows
                 Key: BLUR-220
                 URL: https://issues.apache.org/jira/browse/BLUR-220
             Project: Apache Blur
          Issue Type: Improvement
          Components: Blur
    Affects Versions: 0.3.0
            Reporter: Aaron McCurry
             Fix For: 0.3.0


One of the limitations of Blur is size of Rows stored, specifically the number of Records.
 The current updates are performed on Lucene is by deleting the document and re-adding to
the index.  Unfortunately when any update is perform on a Row in Blur, the entire Row has
to be re-read (if the RowMutationType is UPDATE_ROW) and then whatever modification needs
are made then it is reindexed in it's entirety.

Due to all of this overhead, there is a realistic limit on the size of a given Row.  It may
vary based the kind of hardware that is being used, as the Row grows in size the indexing
(mutations) against that Row will slow.

This issue is being created to discuss techniques on how to deal with this problem.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message