lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-3855) DocValues support
Date Thu, 08 Nov 2012 11:47:12 GMT

    [ https://issues.apache.org/jira/browse/SOLR-3855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13493128#comment-13493128
] 

Yonik Seeley commented on SOLR-3855:
------------------------------------

Regarding performance - it seems like for most users, the number of docvalue fields should
be relatively small.
One of the big advantages to DocValues is the better caching by the OS - so "seeks" should
often never hit the disk.
For those users where performance is a concern, they should set "fl" to retrieve only those
fields they absolutely need.
Also consider existing working clients where the solr server changes the storage type of the
field for better performance - that shouldn't be visible to the client (just as changing the
precisionStep of a trie type should not be visible).

Naming:
At some time in the past I was considering storeSep=true/false (until there were multiple
ways to store separately), but I was never crazy about the name.  But since we're just trying
to say how the field should be stored, perhaps just overload that parameter?

stored=true  // same as today
stored=[docValues method]  // store separately using the given method

I'm not sold on it or anything... just throwing out ideas.

I like the separate param for "disk"/"memory" or "direct"/"memory" - the default access method
for the field really is different from how it's stored.
But it seems like that should just be a default and one should be able to access the field
via direct or memory depending on the situation?
For simply adding additional return fields, direct seems the right approach, unless it's already
been loaded into memory, in which case it would be a nice optimization to use that.




                
> DocValues support
> -----------------
>
>                 Key: SOLR-3855
>                 URL: https://issues.apache.org/jira/browse/SOLR-3855
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Minor
>             Fix For: 4.1, 5.0
>
>         Attachments: SOLR-3855.patch, SOLR-3855.patch
>
>
> It would be nice if Solr supported DocValues:
>  - for ID fields (fewer disk seeks when running distributed search),
>  - for sorting/faceting/function queries (faster warmup time than fieldcache),
>  - better on-disk and in-memory efficiency (you can use packed impls).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message