Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2005710A09 for ; Thu, 9 Jan 2014 09:35:06 +0000 (UTC) Received: (qmail 57970 invoked by uid 500); 9 Jan 2014 09:34:56 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 57894 invoked by uid 500); 9 Jan 2014 09:34:47 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 57877 invoked by uid 99); 9 Jan 2014 09:34:45 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Jan 2014 09:34:45 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of seb2lille@gmail.com designates 209.85.217.176 as permitted sender) Received: from [209.85.217.176] (HELO mail-lb0-f176.google.com) (209.85.217.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Jan 2014 09:34:40 +0000 Received: by mail-lb0-f176.google.com with SMTP id l4so2093635lbv.21 for ; Thu, 09 Jan 2014 01:34:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=VedW80Tbtkubgu97IclbZ7sFWHkJ1qdU1xPi7NNR0+I=; b=G1YgfThNlh7MEQat/mp8cnd8XL9RTRuJvCPBOA9pfPJFEaTlqJkLGmdtAQWY1BIvrL /OH4d9I4l/0w8A/WcPgPYDKl+ELaMZ1LARi6gBplgZrpg3bm47EGG9cF8APq/HUyzCxr RFosHo0KlTc9kuwwx6wmWupbOe4oC4iAGMhSakDMl05GwqX/1hacsZZINXcCk+aqdj4p i5uOXxy+p9K4rcUBLJu7geuSXcondrs9ppe+BDnXHd9nvxc2tJ3x76Q5cHS5yYt7rVqx OifnBH+GXWS6YWOU8cpeetqzG0eligLTyfNKdEJcxYreiwAUYw81P43QQAh4aDF83y1T gCSw== MIME-Version: 1.0 X-Received: by 10.112.150.164 with SMTP id uj4mr769774lbb.46.1389260059208; Thu, 09 Jan 2014 01:34:19 -0800 (PST) Sender: seb2lille@gmail.com Received: by 10.112.37.233 with HTTP; Thu, 9 Jan 2014 01:34:19 -0800 (PST) In-Reply-To: References: Date: Thu, 9 Jan 2014 10:34:19 +0100 X-Google-Sender-Auth: 8EXTgu4HGcl9E3BO_esaF4OWgdY Message-ID: Subject: Re: solr OOM Crash From: =?ISO-8859-1?Q?S=E9bastien_Michel?= To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=047d7b3434bc546f5f04ef8650f8 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b3434bc546f5f04ef8650f8 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi Sandra, Excuse me for the late reply. We use lotsofcores (http://wiki.apache.org/solr/LotsOfCores) Solr feature, around 100 simultaneous loaded cores. But the issue is reproducible with few less cores. We also have a high rate of indexing, and also reindexing (atomic update). We are indexing media files metadata, but also metadata and contents of PDF, the content is stored in a "text" field (stored=3D"true"). Until release 4.3, Solr uses a growing buffer to uncompress stored fields (I assume one buffer per Solr Core or per Shard). The issue comes when Solr read some big docs, the buffer of CompressedStoredFieldReader grows but never shrinks. The more such big docs are read in different threads, the more the Heap usage is growing, until the heap has no more free memory available and GC runs continuously. Analyzing the dump: Class byte[] takes 3gb out of 4gb resourced to the JVM, mainly referenced by CompressingStoredFieldsReader I hope it can help you. S=E9bastien 2013/12/30 Sandra Scott > Hello S=E9bastien, > > Can you give some information about your environment so I can make sure w= e > are having the same problem you had? > Also, did you find out what caused the GC to go crazy or what caused the > increased commit rate? > > Thanks, > Sandra > > > On Thu, Dec 19, 2013 at 12:34 PM, S=E9bastien Michel < > sebastien.michel@atos.net> wrote: > > > Hi Sandra, > > > > I'm not sure if your problem is same as ours, but we encountered the sa= me > > issue on our Solr 4.2, the major memory usage was due to > > CompressingStoredFieldsReader and GC became crazy. > > In our context, we have some stored fields and for some documents the > > content of the text field could be huge. > > > > We resolved our issue with the backport of this fix : > > https://issues.apache.org/jira/browse/LUCENE-4995 > > > > You should also upgrade to Solr 4.4 or more > > > > Regards, > > S=E9bastien > > > > > > 2013/12/12 Sandra Scott > > > > > Helllo, > > > > > > We are experiencing unexplained OOM crashes. We have already seen it = a > > few > > > times, over our different solr instances. The crash happens only at a > > > single shard of the collection. > > > > > > Environment details: > > > 1. Solr 4.3, running on tomcat. > > > 2. 24 Shards. > > > 3. Indexing rate of ~800 docs per minute. > > > > > > Solrconfig.xml: > > > 1. Merge factor 4 > > > 2. Sofrcommit every 10 min > > > 3. Hardcommit every 30 min > > > > > > Main findings: > > > 1. Solr logs: No query failures prior to the OOM, but DOUBLE the amou= nt > > of > > > soft and hard commits in comparison to other shards. > > > 2. Analyzing the dump (VisualVM): Class byte[] takes 4gb out of 5gb > > > resourced to the JVM, mainly referenced by > CompressingStoredFieldsReader > > GC > > > root (which by looking at the code, we suspect they were created due = to > > > CompressingSortedFieldsWriter.merge). > > > > > > Sub findings: > > > 1. GC logs: Showed 108 GC fails prior to the crash. > > > 2. CPI: Overall usage seems fine, but the % of CPU time for the GC > stays > > > high 6 min before the OOM. > > > 3. Memory: Half an hour before OOM the usage slowly rises, until it > gets > > to > > > 5.4gb. > > > > > > Has anyone encountered higher than normal commit rate that seem to > > increase > > > merge rate and cause what I described? > > > > > > --047d7b3434bc546f5f04ef8650f8--