lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "peter chang (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-3638) IndexReader.document always return a doc with all the stored fields loaded. And this can be slow for the indexed document contain huge fields
Date Mon, 12 Dec 2011 03:38:31 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167334#comment-13167334
] 

peter chang commented on LUCENE-3638:
-------------------------------------

i upload this patch just for convenience
{code:title=IndexSearcher.java|borderStyle=solid}
  /* Sugar for <code>.getIndexReader().document(docID)</code> */
  /** see {@link IndexReader#document(int, Set, Set)} for detail*/
  public Document doc(int docID, Set<String> fieldsToAdd, Set<String> fieldsToFilter)
throws CorruptIndexException, IOException {
	return reader.document(docID, fieldsToAdd, fieldsToFilter);
  }
{code}
here, you see the IS also has the access to document fetch. so in this case, IS will look
like powerless if IR can not supply such method or interface to the external.

                
> IndexReader.document always return a doc with all the stored fields loaded. And this
can be slow for the indexed document contain huge fields
> ---------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-3638
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3638
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: core/index, core/search
>    Affects Versions: 4.0
>         Environment: 64bit linux java 1.6
>            Reporter: peter chang
>            Priority: Minor
>              Labels: patch
>             Fix For: 4.0
>
>         Attachments: doc.fields.patch
>
>
> when generating digest for some documents with huge fields, it should be unnecessary
to load the field but just interesting part of the field with the offset information. but
indexreader always return the whole field content. afterward, the customized storedfieldsreader
will got a repeated loading

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message