Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5F5DB200CC6 for ; Tue, 18 Jul 2017 22:48:15 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 5DFA716795D; Tue, 18 Jul 2017 20:48:15 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A401716795C for ; Tue, 18 Jul 2017 22:48:14 +0200 (CEST) Received: (qmail 89475 invoked by uid 500); 18 Jul 2017 20:48:13 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Delivered-To: moderator for java-user@lucene.apache.org Received: (qmail 35452 invoked by uid 99); 18 Jul 2017 19:52:20 -0000 X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.498 X-Spam-Level: ** X-Spam-Status: No, score=2.498 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, KAM_NUMSUBJECT=0.5, RCVD_IN_DNSWL_NONE=-0.0001, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=disabled From: Rilpa Jain To: "java-user@lucene.apache.org" Subject: Migration to Lucene 6.5 Thread-Topic: Migration to Lucene 6.5 Thread-Index: AdL//NI1NOwajoWZQh6J2ruPNAibow== Date: Tue, 18 Jul 2017 19:52:10 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.170.230.60] Content-Type: multipart/alternative; boundary="_000_A597E46153D8A042ACBFAB96DC848FD07ED93FFAUSOPWYMBX01nyof_" MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2017-07-18_10:,, signatures=0 archived-at: Tue, 18 Jul 2017 20:48:15 -0000 --_000_A597E46153D8A042ACBFAB96DC848FD07ED93FFAUSOPWYMBX01nyof_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi, We plan to migrate from lucene 5.5 to 6.5. We have been using DocValuesTerm= sFilter extensively which was deprecated in Lucene 5.5 and removed in Lucen= e 6.0. The Javadoc specifies to use DocValuesTermsQuery and BoolenaClause.Occur.Fi= lter instead. However, as per our local tests, the time taken to search doc= uments has increased with this change. Below is one of the scenarios in our application - We do a search within a search. (Before migration to Lucene 5.5) 1. The first search is on a text field with discrete values. (There is= no pattern to the value of this text field. Here the terms[] ranges from 1= to 200k in size.) - We use DocValuesTermsFilter and pass it is as Filter = parameter to search method. 2. The second search is on result of step 1- This could be either a Te= rmQuery or NumericRangeQuery, evaluated to query and added as query paramet= er to search method. (After migration to Lucene 6.5) 1. The first search is on a text field with discrete values. (There is= no pattern to the value of this text field. Here the terms[] ranges from 1= to 200k in size.) - We use DocValuesTermsQuery and add it to BooleanQuery= with Occur.Filter. 2. The second search is on result of step 1- This could be either a Te= rmQuery or NumericRangeQuery added to BooleanQuery with Occur.MUST. 3. The booleanQuery is build and passed to search method. This query execution after migration takes 5x-10x times more as compared to= using DocValuesTermsFilter. Is there a better class to generate query in our scenario than the one used= above? Or is there anything that I am missing? Any insights would help! Thanks. ________________________________________________________________________ The information in this email is confidential and may be legally privileged= . It is intended solely for the addressee. Access to this email by anyone e= lse is unauthorized. If you are not the intended recipient, any disclosure,= copying, distribution or any action taken or omitted to be taken in relian= ce on it, is prohibited and may be unlawful. Tradeweb reserves the right to monitor and review the content of all messag= es sent to or from this e-mail address. Messages sent to or from this e-mai= l address may be stored on the Tradeweb e-mail system. --_000_A597E46153D8A042ACBFAB96DC848FD07ED93FFAUSOPWYMBX01nyof_--