Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B11CE200BE8 for ; Fri, 23 Dec 2016 19:26:56 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id AFA0D160B1F; Fri, 23 Dec 2016 18:26:56 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 06F8A160B0C for ; Fri, 23 Dec 2016 19:26:55 +0100 (CET) Received: (qmail 90942 invoked by uid 500); 23 Dec 2016 18:26:50 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 90931 invoked by uid 99); 23 Dec 2016 18:26:49 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Dec 2016 18:26:49 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 45F2118128D for ; Fri, 23 Dec 2016 18:26:49 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.311 X-Spam-Level: *** X-Spam-Status: No, score=3.311 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, URI_HEX=1.313] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id WnKNHzDbUc3f for ; Fri, 23 Dec 2016 18:26:46 +0000 (UTC) Received: from mail.sd-datasolutions.de (serv2.sd-datasolutions.de [85.25.204.22]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 685815F470 for ; Fri, 23 Dec 2016 18:26:46 +0000 (UTC) Received: from p200300C103CBF201A1E7954CE67725D8.dip0.t-ipconnect.de (p200300C103CBF201A1E7954CE67725D8.dip0.t-ipconnect.de [IPv6:2003:c1:3cb:f201:a1e7:954c:e677:25d8]) by mail.sd-datasolutions.de (Postfix) with ESMTPSA id 90E1016F802A3; Fri, 23 Dec 2016 18:26:40 +0000 (UTC) Date: Fri, 23 Dec 2016 18:26:30 +0000 X-NSA-Greeting: Dear NSA, have fun with reading and analyzing this e-mail! From: Uwe Schindler To: java-user@lucene.apache.org,Sadaf Subject: Re: OOM error User-Agent: K-9 Mail for Android In-Reply-To: <1482513290841-4311073.post@n3.nabble.com> References: <1482513290841-4311073.post@n3.nabble.com> Message-ID: <917D6CDB-C1BC-4C48-8244-7C03E77070FF@thetaphi.de> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----0O80W5Q1TJFPJL7L4DAML6YVATNPLC" Content-Transfer-Encoding: 7bit archived-at: Fri, 23 Dec 2016 18:26:56 -0000 ------0O80W5Q1TJFPJL7L4DAML6YVATNPLC Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Hi, For sorting you should add the fields as DocValues type in addition to ind= exed fields (if you also query on them)=2E That ist the problem=2E But for = that you should update your Lucene version=2E For that reason Lucene 6 no longer has FieldCache, which emulates DocValue= s loading everything to heap, which is horrible for StringFields=2E Uwe Am 23=2E Dezember 2016 18:14:50 MEZ schrieb Sadaf : >Hi,=20 > >This is the index we are using: >Number of fields: 355=2E=20 >Number of documents: 225 thousand=2E=20 >Number of terms: 5522 thousand=2E=20 >The index size is around 800MB=2E >TermInfos index divisor: 1 >Index format: Lucene 4=2E0=2E=20 >We are getting Java OutOfMemory error with searches=2E We are using a >heap >size of 1GB=2E We are not able to increase our heap size=2E >Looking at the HeapDump, we have two suspects: FieldCacheImpl and >FieldCacheImpl$BinaryDocValuesImpl=2E >The searches we are doing have a lot of sorts=2E The sorting is being >done on >string fields=2E I don=E2=80=99t have much experience with OutOfMemory er= rors or >with >Lucene=2E What should I try to do? >- Should I try to play with the swappiness on the server=2E Currently set >to >60? >- We are creating a new IndexSearcher for each search? Should I just >have >one? If I do that, will my results be updated as new documents are >being >added to the index? >- If you think that sorting on StringFields is the problem can you give >me >some pointers on what the usual suspects are? >- Should I insist that we need more Heap? >Thanks, (I will not be checking my email until new year=2E Wishing >everyone >here Happy Holidays) >Sadaf > > > >-- >View this message in context: >http://lucene=2E472066=2En3=2Enabble=2Ecom/OOM-error-tp4311073=2Ehtml >Sent from the Lucene - Java Users mailing list archive at Nabble=2Ecom=2E > >--------------------------------------------------------------------- >To unsubscribe, e-mail: java-user-unsubscribe@lucene=2Eapache=2Eorg >For additional commands, e-mail: java-user-help@lucene=2Eapache=2Eorg -- Uwe Schindler Achterdiek 19, 28357 Bremen https://www=2Ethetaphi=2Ede ------0O80W5Q1TJFPJL7L4DAML6YVATNPLC--