lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-4484) NRTCachingDir can't handle large files
Date Tue, 16 Oct 2012 12:49:03 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-4484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13476964#comment-13476964
] 

Robert Muir commented on LUCENE-4484:
-------------------------------------

{quote}
... it seems hairy.

We could also just leave it be, ie advertise this limitation. NRTCachingDir is already hairy
enough... The purpose of this directory is to be used in an NRT setting where you have relatively
frequent reopens compared to the indexing rate, and this naturally keeps files plenty small.
{quote}

This seems fine to me. I think lets just do javadocs?

Because in general there are lots of other combinations of stupid parameters that can cause
OOM/Out of Open Files/etc and we can't prevent all of them.
                
> NRTCachingDir can't handle large files
> --------------------------------------
>
>                 Key: LUCENE-4484
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4484
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Michael McCandless
>
> I dug into this OOME, which easily repros for me on rev 1398268:
> {noformat}
> ant test  -Dtestcase=Test4GBStoredFields -Dtests.method=test -Dtests.seed=2D89DD229CD304F5
-Dtests.multiplier=3 -Dtests.nightly=true -Dtests.slow=true -Dtests.linedocsfile=/home/hudson/lucene-data/enwiki.random.lines.txt
-Dtests.locale=ru -Dtests.timezone=Asia/Vladivostok -Dtests.file.encoding=UTF-8 -Dtests.verbose=true
> {noformat}
> The problem is the test got NRTCachingDir ... which cannot handle large files because
it decides up front (when createOutput is called) whether the file will be in RAMDir vs wrapped
dir ... so if that file turns out to be immense (which this test does since stored fields
files can grow arbitrarily huge w/o any flush happening) then it takes unbounded RAM.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message