Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E652418FEE for ; Tue, 8 Mar 2016 16:16:25 +0000 (UTC) Received: (qmail 83491 invoked by uid 500); 8 Mar 2016 16:16:24 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 83438 invoked by uid 500); 8 Mar 2016 16:16:24 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 83426 invoked by uid 99); 8 Mar 2016 16:16:24 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Mar 2016 16:16:24 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 989DFC0314 for ; Tue, 8 Mar 2016 16:16:23 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.198 X-Spam-Level: * X-Spam-Status: No, score=1.198 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id F6iOxSbMfcMS for ; Tue, 8 Mar 2016 16:16:22 +0000 (UTC) Received: from mail-ob0-f175.google.com (mail-ob0-f175.google.com [209.85.214.175]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 82F3A5FB5A for ; Tue, 8 Mar 2016 16:16:22 +0000 (UTC) Received: by mail-ob0-f175.google.com with SMTP id m7so18338666obh.3 for ; Tue, 08 Mar 2016 08:16:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to; bh=xi76VjRT8zFuEslvDr/MZJxrFyAROVd4U9mrEEiPzJI=; b=Xyg9tdyyVaiC9UTjmIJrO+JeUYxIKJSkbKGGUcHkEQiAFghy6pp43V/YfzGX0isi1e a7zjJelp6fOf7hFQWHMVIKQBRO3ULiDGFJXA1lL91CnIzSfy37Xtw+UNtzKRUw/umz++ kCN52S5Z+Bf1t1f49nT5NX2ZSKG6aPC6RpMxbGsOilt/qjBX95IqbfzXvrmZX/CTilVJ 2mZvI2KgiL4zGkrDEJq6uobyIlPVSWby0eC0bpMrrgyY3TnodUSBskeVV9bcYgMaCsMe EEXU4AX8JA3OeFEyWYCjCs4UJbAt4C+LqCBZ1F/Fwm0tASMMcKL2mcZUNIPmy6DXD7NM RCdA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=xi76VjRT8zFuEslvDr/MZJxrFyAROVd4U9mrEEiPzJI=; b=GNSMRHFx/QACbLtcVNHL6vJWmBA/msBHZ0WIOEuxKqWBknsdduIs2jk+nB2qiOyfgq puM9jKHkgrh/lQrMRy+tefvsk5db1/o0x0zShkTWzn2wEvrPKM8JMPIs5Yyy1Dv5bySd IOCAaPIhwMnYGq5VxTfeM3908FM09jZrzNoTDJFV4r4EpoI5CR/oRkceeMq7ccdjmgqF NAstnJbFp55PdaBp1J4rjUPy1huAlkB4TX1jFZQJ8GeI/+gdqRsAR15Ei3uYhkrA7UVw tekbPTEEAN+VOkBg+Z7jWnm1flWIdgvx6nRavMZFvTnk0uETeZJS/GgREYu/YyAKhJFI ypng== X-Gm-Message-State: AD7BkJJwKvbPCe3e2qWX7QOzQNS46DuC0PbdbTaYpFGzDj3XMMTGIC1GuXu8wLcjAwkYfbCTPribLM9MEVK1xA== X-Received: by 10.60.57.193 with SMTP id k1mr17986548oeq.66.1457453775925; Tue, 08 Mar 2016 08:16:15 -0800 (PST) MIME-Version: 1.0 From: =?UTF-8?B?S3VkcmV0dGluIEfDvGxlcnnDvHo=?= Date: Tue, 08 Mar 2016 16:16:06 +0000 Message-ID: Subject: debugging IndexSearcher.search performance To: "java-user@lucene.apache.org" Content-Type: multipart/alternative; boundary=089e013a02d49738b6052d8be6da --089e013a02d49738b6052d8be6da Content-Type: text/plain; charset=UTF-8 Hi, The code I am working on is spending long time in this function: Searcher.java:221 org.apache.lucene.search.IndexSearcher.search(Query, int, Sort) 94400ms 95% Query fed to the function looks ugly at first and I first thought that it could be the culprit: +body:/.*foo/ +((+dir1:foo +dir2:bar) (+dir1:baz) (+dir1:bin) ...(the list goes on for all top level directories indexed. For a total of ~60 directories)) However same query completes much faster in Luke. It shouldn't be because of Sort either because it takes long time for no matches, too. Any suggestions while I debug this issue? Lucene 5 is what I am using currently. Thanks, Kudret --089e013a02d49738b6052d8be6da--