Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9A2884755 for ; Sun, 3 Jul 2011 15:26:13 +0000 (UTC) Received: (qmail 11719 invoked by uid 500); 3 Jul 2011 15:26:11 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 11680 invoked by uid 500); 3 Jul 2011 15:26:10 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 11673 invoked by uid 99); 3 Jul 2011 15:26:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 03 Jul 2011 15:26:10 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.212.48] (HELO mail-vw0-f48.google.com) (209.85.212.48) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 03 Jul 2011 15:26:05 +0000 Received: by vws7 with SMTP id 7so5475723vws.35 for ; Sun, 03 Jul 2011 08:25:44 -0700 (PDT) MIME-Version: 1.0 Received: by 10.52.178.165 with SMTP id cz5mr2214959vdc.164.1309706744500; Sun, 03 Jul 2011 08:25:44 -0700 (PDT) Received: by 10.52.157.226 with HTTP; Sun, 3 Jul 2011 08:25:44 -0700 (PDT) In-Reply-To: <1309526890.54413.YahooMailRC@web29020.mail.ird.yahoo.com> References: <1309526890.54413.YahooMailRC@web29020.mail.ird.yahoo.com> Date: Sun, 3 Jul 2011 11:25:44 -0400 Message-ID: Subject: Re: revisit naming for grouping/join? From: Michael McCandless To: dev@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable On Fri, Jul 1, 2011 at 9:28 AM, mark harwood wrot= e: >>> I think what would be best is a smallish but feature complete demo, > > For the nested stuff I had a reasonable demo on LUCENE-2454 that was base= d > around resumes - that use case has the one-to-many characteristics that l= ends > itself to nested e.g. a person has many different qualifications and reco= rds of > employment. > This scenario was illustrated > here: http://www.slideshare.net/MarkHarwood/proposal-for-nested-document-= support-in-lucene > > I also had the "book search" type scenario where a book has many sections= and > for the purposes of efficient highlighting/summarisation =A0these section= s were > treated as child docs which could be read quickly (rather than highlighti= ng a > whole book) I think both resumes and book search, and also others like the variants of a product SKU, would all make good examples for the nested docs use case. > I'm not sure what the "parent" was in your doctor and cities example, Mik= e. If a > doctor is in only one city then there is no point making city a child doc= as the > one city info can happily be combined with the doctor info into a single > document with no conflict (doctors have different properties to cities). > If the city is the parent with many child doctor docs that makes more sen= se but > feels like a less likely use case e.g. "find me a city with doctor x and = a > different doctor y" > Searching for a person with excellent java and prefrerably good lucene sk= ills > feels like a more real-world example. In my example the city was parent -- I raised this example to explain that index-time joining is more general than just nested docs (ie, I think we should keep the name "join" for this module... also because we should factor out more general search-time-only join capabilities into it). > It feels like documenting some of the trade-offs behind index design choi= ces is > useful too e.g. nesting is not too great for very volatile content with > constantly changing children while search-time join is more costly in RAM= and > 2-pass processing +1, especially once we've factored out generic joins. Mike --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org