From solr-user-return-144554-archive-asf-public=cust-asf.ponee.io@lucene.apache.org Tue Oct 23 17:18:41 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 52B9818066B for ; Tue, 23 Oct 2018 17:18:41 +0200 (CEST) Received: (qmail 6917 invoked by uid 500); 23 Oct 2018 15:18:39 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Delivered-To: moderator for solr-user@lucene.apache.org Received: (qmail 13076 invoked by uid 99); 23 Oct 2018 14:20:42 -0000 X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.898 X-Spam-Level: * X-Spam-Status: No, score=1.898 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=sheroes.in X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=d+khaVV7DeMqspQPPwz6wSURRz8TYVS+kwjaUL/FyDM=; b=G2STkLjVr+hSDNehijOqy+Rrjivbq8kVkM6o2WlhpNhLqDb56PqnYcu34pkuUPLTMB septaR5CajR5Gb0a+0R9JoCJn1NgLfeGYnDqOKcOpxHxBBlQrpIRICvtwIA4aXsOYejN qNgO0xOV+MqUArW4mTh/6t9iHW58s2mGT21p89F4Pe3f8WOqfyW1wd81OgkHtznLyQ6J wg95jq9QhUVSLBs65VTM6zfgfpL/h9a4t0+2p4ZtcSjaPffDt7rTu9gDKR0C3ZOWE57U Qmc99S0FKcVwVjL9qchjNIY51Pxvq+Z9wuwYJH9rPOoo42UMmB35QWXDkQ2rYgO+TW0h 5LyQ== X-Gm-Message-State: ABuFfogwr4MWH4iIPmeN+wQlPeBIfQFxHqk3ZpTqFH8MYA+392yTUITv SDIVblfU2G1aO6t9MZcRAevhWxqRLBoX88idABoKEBeFT0Y= X-Google-Smtp-Source: ACcGV61cqMQ3ULlQev0FlkI+cB1dKGft2XCTOXfHqN4YdNHC6aEFxmRadp93U27p6sqGKlE4tiYU0HJ2Fhxp4UUkSkE= X-Received: by 2002:a25:9d11:: with SMTP id i17-v6mr34871847ybp.456.1540304432269; Tue, 23 Oct 2018 07:20:32 -0700 (PDT) MIME-Version: 1.0 From: Gauri Dhawan Date: Tue, 23 Oct 2018 19:50:21 +0530 Message-ID: Subject: Regarding multi keyword search To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary="000000000000881f470578e612ec" --000000000000881f470578e612ec Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi! I have been facing an issue for quite some time and haven't been able to come to a solution as of yet. We are trying to implement search on our platform and all our data is stored in Solr. I have a field `description` which is the field where I have to search. It is of the field type `text_edit_suggest` and it looks something like thi= s > > > > > pattern=3D"([\.,;:-_])" replacement=3D" " replace=3D"all"/> > minGramSize=3D"1"/> > pattern=3D"([^\w\d\*=C3=A6=C3=B8=C3=A5=C3=86=C3=98=C3=85 ])" replacement= =3D"" replace=3D"all"/> > ignoreCase=3D"true" expand=3D"false"/> > > > > > > > > pattern=3D"([\.,;:-_])" replacement=3D" " replace=3D"all"/> > pattern=3D"([^\w\d\*=C3=A6=C3=B8=C3=A5=C3=86=C3=98=C3=85 ])" replacement= =3D"" replace=3D"all"/> > pattern=3D"^(.{30})(.*)?" replacement=3D"$1" replace=3D"all"/> > ignoreCase=3D"true" expand=3D"false"/> > When I search for multiple keywords, the results are unexpected. For example : I want to search for the words `first` and `post` and both these words should be present in the description field of the document else it shouldn't return the document. I've tried some 50+ queries for this. Used `edismax` parser as well but in vain. Tried boosting as well. But most queries result in weight given to either one of the keywords and results in documents that have that keyword but not the other. Can you guys help? Thanks in advance! Gauri Dhawan Associate Software Engineer SHEROES --000000000000881f470578e612ec--