jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alex Parvulescu (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (JCR-3282) Optimize usage of norms
Date Wed, 28 Mar 2012 14:41:28 GMT

     [ https://issues.apache.org/jira/browse/JCR-3282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Alex Parvulescu updated JCR-3282:

    Attachment: JCR-3282.patch

attaching proposed patch.

Based on the indexing config we know if a field has its boost changed or not, so if a field
has no boost setting we can safely disable norms.
> Optimize usage of norms
> -----------------------
>                 Key: JCR-3282
>                 URL: https://issues.apache.org/jira/browse/JCR-3282
>             Project: Jackrabbit Content Repository
>          Issue Type: Improvement
>          Components: indexing, jackrabbit-core
>            Reporter: Alex Parvulescu
>            Assignee: Alex Parvulescu
>         Attachments: JCR-3282.patch
> There is a very significant potential for optimizing the size of the search index.
> We have seen a case where there were multiple segments with about the same number of
nodes (roughly 10 million), but the size on disk was very different.
> One segment was 19 GB while all others where around 3 GB. The major difference was the
number of fields indexed. The large segment had significantly more fields, which resulted
in a large norms file.
> We should go through our implementation and see where norms are really necessary and
disable tracking of norms wherever possible.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message