Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 91468 invoked from network); 6 Dec 2007 00:43:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 6 Dec 2007 00:43:11 -0000 Received: (qmail 30922 invoked by uid 500); 6 Dec 2007 00:42:55 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 29939 invoked by uid 500); 6 Dec 2007 00:42:53 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 29928 invoked by uid 99); 6 Dec 2007 00:42:53 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Dec 2007 16:42:53 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [203.217.22.128] (HELO file1.syd.nuix.com.au) (203.217.22.128) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Dec 2007 00:42:33 +0000 Received: from host68.syd.nuix.com.au (host68.syd.nuix.com.au [192.168.222.68]) by file1.syd.nuix.com.au (Postfix) with ESMTP id 6557D4A823D for ; Thu, 6 Dec 2007 11:41:54 +1100 (EST) From: Daniel Noll Organization: Nuix Pty Ltd To: java-user@lucene.apache.org Subject: Tricky (maybe) query question Date: Thu, 6 Dec 2007 11:42:32 +1100 User-Agent: KMail/1.9.6 (enterprise 0.20070907.709405) MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200712061142.32839.daniel@nuix.com> X-Virus-Checked: Checked by ClamAV on apache.org Hi all. Suppose you have a text index with a field used for deduplication, and then you later add a second field with further information that might also be used for deduplication. We'll call them A and B for the sake of brevity. If I have only a current text index, then I can use (a:foo AND b:bar) to deduplicate. However, I still want to deduplicate between the older ones which don't have B and the new ones which do. Is there a way I can do a query which will: - Match a document if both a:foo and b:bar are matched - Match a document if a:foo matches and b is absent, or vice versa. - Not match a document if both a:foo and b:foo are absent - Not match a document if either a:foo or b:foo are present and do not match ? If not, I suppose I'll have to go with the lowest common denominator approach and find out which fields are present in every index. Daniel --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org