Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8491CD8BC for ; Tue, 16 Oct 2012 22:55:05 +0000 (UTC) Received: (qmail 48971 invoked by uid 500); 16 Oct 2012 22:55:04 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 48898 invoked by uid 500); 16 Oct 2012 22:55:03 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 48652 invoked by uid 99); 16 Oct 2012 22:55:03 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Oct 2012 22:55:03 +0000 Date: Tue, 16 Oct 2012 22:55:03 +0000 (UTC) From: "Michael McCandless (JIRA)" To: dev@lucene.apache.org Message-ID: <2145138737.54497.1350428103742.JavaMail.jiratomcat@arcas> In-Reply-To: <1725772694.48441.1350343266661.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (LUCENE-4484) NRTCachingDir can't handle large files MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/LUCENE-4484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13477434#comment-13477434 ] Michael McCandless commented on LUCENE-4484: -------------------------------------------- bq. (Solr defaults to NRTCachingDir) Maybe it shouldn't? Or ... does it also default to NRT searching, like ElasticSearch (I think), i.e. frequently opening a new searcher? In which case it's a good default I think... > NRTCachingDir can't handle large files > -------------------------------------- > > Key: LUCENE-4484 > URL: https://issues.apache.org/jira/browse/LUCENE-4484 > Project: Lucene - Core > Issue Type: Bug > Reporter: Michael McCandless > > I dug into this OOME, which easily repros for me on rev 1398268: > {noformat} > ant test -Dtestcase=Test4GBStoredFields -Dtests.method=test -Dtests.seed=2D89DD229CD304F5 -Dtests.multiplier=3 -Dtests.nightly=true -Dtests.slow=true -Dtests.linedocsfile=/home/hudson/lucene-data/enwiki.random.lines.txt -Dtests.locale=ru -Dtests.timezone=Asia/Vladivostok -Dtests.file.encoding=UTF-8 -Dtests.verbose=true > {noformat} > The problem is the test got NRTCachingDir ... which cannot handle large files because it decides up front (when createOutput is called) whether the file will be in RAMDir vs wrapped dir ... so if that file turns out to be immense (which this test does since stored fields files can grow arbitrarily huge w/o any flush happening) then it takes unbounded RAM. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org