Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 36C4210949 for ; Thu, 24 Oct 2013 09:09:59 +0000 (UTC) Received: (qmail 90145 invoked by uid 500); 24 Oct 2013 09:09:44 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 90095 invoked by uid 500); 24 Oct 2013 09:09:37 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 90012 invoked by uid 99); 24 Oct 2013 09:09:36 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Oct 2013 09:09:36 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jpountz@gmail.com designates 209.85.214.52 as permitted sender) Received: from [209.85.214.52] (HELO mail-bk0-f52.google.com) (209.85.214.52) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Oct 2013 09:09:30 +0000 Received: by mail-bk0-f52.google.com with SMTP id e11so705829bkh.25 for ; Thu, 24 Oct 2013 02:09:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=plFlIbnSXAzrxMtQugEqPkdQz5wFZ1AbXVUCva4+QTU=; b=0ZuXHv/q6zz/X37467bWn4rWvZmJPs63j12c4Fw6zdhhwEnE9KMhYv/IASgBSovun+ d+SMa17bSHIW5wDtSxPKkG+NOJCLapQgTxFhhgcxFfCVal7yUu1F85xZwaIwbo0p2Tlp OMKOUdfQkzTcRt+15AkRem0DUFuPl33W2er+EU/My+vewwGwd8VxPb2ks/z3hcoG1vL0 89knBwwnauY6mW8lwr9N5gup6sJp1D1Rifh3m581v6KqHfGWot40GXOYckiPerXmcfdd 3r3EWBWMpaSoxGH6qmR2gF0vEuaL/Ts1lNevlNETZayQvKWwSsOF5RsRg1/j91nRWB8W /UJQ== X-Received: by 10.205.35.15 with SMTP id su15mr979963bkb.21.1382605749530; Thu, 24 Oct 2013 02:09:09 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.241.129 with HTTP; Thu, 24 Oct 2013 02:08:49 -0700 (PDT) In-Reply-To: References: From: Adrien Grand Date: Thu, 24 Oct 2013 11:08:49 +0200 Message-ID: Subject: Re: Merging ordered segments without re-sorting. To: java-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org Hi, On Thu, Oct 24, 2013 at 12:20 AM, Arvind Kalyan wrote: > I will benchmark the available approach itself then, in that case. Will > revert back if the performance in unacceptable. For the record, last time I checked, indexing was 2x slower on average on a 10M document collection (see https://issues.apache.org/jira/browse/LUCENE-4752?focusedCommentId=13605896&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13605896) and most of the time was not actually spent in sorting the doc IDs but merging stored fields (because we by-pass the specialized sequential merging impl which is usually used when merging segment without sorting). -- Adrien --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org