Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D75B01016B for ; Tue, 30 Dec 2014 16:56:57 +0000 (UTC) Received: (qmail 16403 invoked by uid 500); 30 Dec 2014 16:56:54 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 16337 invoked by uid 500); 30 Dec 2014 16:56:54 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 16305 invoked by uid 99); 30 Dec 2014 16:56:51 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Dec 2014 16:56:51 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [162.129.8.130] (HELO smtpauth.johnshopkins.edu) (162.129.8.130) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Dec 2014 16:56:46 +0000 X-IronPort-AV: E=Sophos;i="5.07,666,1413259200"; d="scan'208";a="63531896" Received: from msel-sysmac14.mse.jhu.edu ([10.161.51.103]) by IPEB1.johnshopkins.edu with ESMTP/TLS/DHE-RSA-AES128-SHA; 30 Dec 2014 11:53:25 -0500 Message-ID: <54A2D885.7080300@jhu.edu> Date: Tue, 30 Dec 2014 11:53:25 -0500 From: Jonathan Rochkind User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:24.0) Gecko/20100101 Thunderbird/24.6.0 MIME-Version: 1.0 To: solr-user@lucene.apache.org Subject: Re: WordDelimiter filter, expanding to multiple words, unexpected results References: <5405F354.5070901@jhu.edu> <5405F958.9040208@jhu.edu> <54061DE7.3060801@jhu.edu> <54073842.9050109@jhu.edu> <54A1D1E1.1020209@jhu.edu> <54A1DE94.8030102@jhu.edu> <54A2CEEA.70805@jhu.edu> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 12/30/14 11:45 AM, Alexandre Rafalovitch wrote: > On 30 December 2014 at 11:12, Jonathan Rochkind wrote: >> I'm a bit confused about what splitOnCaseChange combined with catenateWords >> is meant to do at all. It _is_ generating both the split and single-word >> tokens at query time > > Have you tried only having WDF during indexing with both options set? > And same chain but without WDF at all during query? Without WDF at all in the query, then "mixedCase" in query would match "mixedCase" in index, but would no longer match "mixed Case" in index. I thought I was using WDF in such a way that "mixedCase" in query could match both/either "mixedCase" and/or "mixed Case" in the index. And I thought this was an intended use case of the WDF. But perhaps I was wrong, and the WDF simply can't do this? Is WDF intended mainly for use at index time and not query time? In general, I'm confused about the various things WDF can and can't do, and the various configurations to make it do that. Thanks for everyone's advice.