Return-Path: Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: (qmail 90814 invoked from network); 18 Oct 2009 19:26:23 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 18 Oct 2009 19:26:23 -0000 Received: (qmail 19898 invoked by uid 500); 18 Oct 2009 19:26:21 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 19824 invoked by uid 500); 18 Oct 2009 19:26:20 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 19814 invoked by uid 99); 18 Oct 2009 19:26:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 18 Oct 2009 19:26:20 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of germanbiozzoli@gmail.com designates 209.85.220.216 as permitted sender) Received: from [209.85.220.216] (HELO mail-fx0-f216.google.com) (209.85.220.216) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 18 Oct 2009 19:26:12 +0000 Received: by fxm12 with SMTP id 12so4440196fxm.5 for ; Sun, 18 Oct 2009 12:25:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=y/94yum6L0zI4TlB42r9B3Oont1I0bNyPVO32+DeOpU=; b=lYvPGKAtV34V7O9Y+8KISGJOz8PVglVcWPFZ8PnNphcx2QyqKQKQZ6DNxMhayTuxqU mEhfE0fQlp+Avl2WiZiuNz6jYHE/SYqbMwT+rVSsZ08ACUVzu9mLNmp4C00rrnhe+X4l p6mMd56LJ+ls7L7oXG/dyN49BWKz+KC2pjduI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=wHb1KIBu4LgpC18Czm1kGfqI/rveq2+bikHeFParkBkojxaOymqzg14ci/lP5EaaFm KyiDC23gv/S8uh7LnG84uI5TGOphuoJzbTuzIdD9ZNxo8Wyq6a1eDTCiJHiLlx5o1rbS 6b8DDMHJXINhjVGjp0gh136fN7TTeczhJ+CmQ= MIME-Version: 1.0 Received: by 10.223.63.202 with SMTP id c10mr707474fai.73.1255893952163; Sun, 18 Oct 2009 12:25:52 -0700 (PDT) In-Reply-To: <164816.57438.qm@web52903.mail.re2.yahoo.com> References: <2133efdb0910171832o3724c668k5898e785a6aa420e@mail.gmail.com> <164816.57438.qm@web52903.mail.re2.yahoo.com> Date: Sun, 18 Oct 2009 16:25:52 -0300 Message-ID: <2133efdb0910181225s3bf55a11p1330a5bf42351ca4@mail.gmail.com> Subject: Re: Problem with Query Parser From: =?ISO-8859-1?Q?Germ=E1n_Biozzoli?= To: solr-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Thanks Ahmet. Definitely using analyzer appears the english porter as the killer ;) Regards German On Sun, Oct 18, 2009 at 7:30 AM, AHMET ARSLAN wrote: > >> Hi everybody >> >> I have a simple but (for me) annoying problem. I'm happy >> user of Solr >> 1.4 with a small collection of documents. Today one of the >> users has >> reported that a query returns documents that are >> non-pertinent to the >> expression. I have spanish, portuguese and english text >> inside the >> collection. Using the Solr administration interface I've >> found that >> she was right, if I search for the spanish term >> "represion", I found >> just only the word root, I mean it returns every document >> with the >> term "repres". Using the admin-debug search I found this: >> >> >> >> > name=3D"rawquerystring">description:represion >> > name=3D"querystring">description:represion >> > name=3D"parsedquery">description:repres >> > name=3D"parsedquery_toString">description:repres >> >> the "ion" part of the term was deleted by the query parser. >> The first >> question is: I don=B4t know now where should I see to >> correct this, at >> the schema.xml or at the solrconfig.xml. > >> The only thing that is suspicious to me is the >> EnglishPorter. > > Yes you are right. "ion" part of the term was deleted by it. You can veri= fy this using /admin/analysis.jsp page. It will tell you which TokenFilterF= actory removes it. > >> I've deleted from the configuration but nothing changes. Should >> I reindex the collection to see the changes? > > Yes re-index is necessary. > >> Should I delete also from the index section? > > You should remove English porter from both query and index analyzer. > >> What I will loose deleting English porter? > > You will lose stemming functionality. But since you have spanish, portugu= ese and english documents using English porter for all the documents is not= meaningful. > > > > >