Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 47889 invoked from network); 15 Sep 2008 11:38:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 15 Sep 2008 11:38:19 -0000 Received: (qmail 21209 invoked by uid 500); 15 Sep 2008 11:38:15 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 20546 invoked by uid 500); 15 Sep 2008 11:38:14 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 20537 invoked by uid 99); 15 Sep 2008 11:38:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Sep 2008 04:38:14 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Sep 2008 11:37:16 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 13D53234C1D5 for ; Mon, 15 Sep 2008 04:37:49 -0700 (PDT) Message-ID: <194314128.1221478669080.JavaMail.jira@brutus> Date: Mon, 15 Sep 2008 04:37:49 -0700 (PDT) From: "Mck SembWever (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Updated: (LUCENE-1380) Patch for ShingleFilter.enablePositions In-Reply-To: <1699917986.1221044324297.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mck SembWever updated LUCENE-1380: ---------------------------------- Description: Make it possible for *all* words and shingles to be placed at the same position, that is for _all_ shingles (and unigrams if included) to be treated as synonyms of each other. Today the shingles generated are synonyms only to the first term in the shingle. For example the query "abcd efgh ijkl" results in: ("abcd" "abcd efgh" "abcd efgh ijkl") ("efgh" efgh ijkl") ("ijkl") where "abcd efgh" and "abcd efgh ijkl" are synonyms of "abcd", and "efgh ijkl" is a synonym of "efgh". There exists no way today to alter which token a particular shingle is a synonym for. This patch takes the first step in making it possible to make all shingles (and unigrams if included) synonyms of each other. See http://comments.gmane.org/gmane.comp.jakarta.lucene.user/34746 for mailing list thread. was: Make it possible for *all* words and shingles to be placed at the same position, that is to _all_ be treated as synonyms of each other. Today the shingles generated are synonyms only to the first term in the shingle. For example the query "abcd efgh ijkl" results in: ("abcd" "abcd efgh" "abcd efgh ijkl") ("efgh" efgh ijkl") ("ijkl") where "abcd efgh" and "abcd efgh ijkl" are synonyms of "abcd", and "efgh ijkl" is a synonym of "efgh". There exists no way today to alter which token a particular shingle is a synonym for. This patch takes the first step in making it possible to make all shingles (and unigrams if included) synonyms of each other. See http://comments.gmane.org/gmane.comp.jakarta.lucene.user/34746 for mailing list thread. Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) typo is editing description. > Patch for ShingleFilter.enablePositions > --------------------------------------- > > Key: LUCENE-1380 > URL: https://issues.apache.org/jira/browse/LUCENE-1380 > Project: Lucene - Java > Issue Type: Improvement > Components: contrib/analyzers > Reporter: Mck SembWever > Assignee: Karl Wettin > Priority: Trivial > Attachments: LUCENE-1380.patch, LUCENE-1380.patch > > > Make it possible for *all* words and shingles to be placed at the same position, that is for _all_ shingles (and unigrams if included) to be treated as synonyms of each other. > Today the shingles generated are synonyms only to the first term in the shingle. > For example the query "abcd efgh ijkl" results in: > ("abcd" "abcd efgh" "abcd efgh ijkl") ("efgh" efgh ijkl") ("ijkl") > where "abcd efgh" and "abcd efgh ijkl" are synonyms of "abcd", and "efgh ijkl" is a synonym of "efgh". > There exists no way today to alter which token a particular shingle is a synonym for. > This patch takes the first step in making it possible to make all shingles (and unigrams if included) synonyms of each other. > See http://comments.gmane.org/gmane.comp.jakarta.lucene.user/34746 for mailing list thread. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org