Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 15542 invoked from network); 1 Apr 2004 19:56:05 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 1 Apr 2004 19:56:05 -0000 Received: (qmail 30201 invoked by uid 500); 1 Apr 2004 19:55:50 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 30178 invoked by uid 500); 1 Apr 2004 19:55:50 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 30157 invoked from network); 1 Apr 2004 19:55:50 -0000 Received: from unknown (HELO avas6.telusquebec.local) (142.169.1.121) by daedalus.apache.org with SMTP; 1 Apr 2004 19:55:50 -0000 Received: from nodnsquery(192.168.250.8) by avas6.telusquebec.local via csmap id 78838b9a_8416_11d8_8f1b_0002b3e6f1b0_9744; Thu, 01 Apr 2004 14:55:05 -0500 (EST) Received: from avas6.telusquebec.local (foo.nstein.com [206.162.161.114]) by smtp1.globetrotter.net (iPlanet Messaging Server 5.2) with ESMTPA id <0HVI005KFCP5ST@"TELUS Quebec"> for lucene-user@jakarta.apache.org; Thu, 01 Apr 2004 14:55:54 -0500 (EST) Received: from foo.nstein.com(206.162.161.114) by avas6.telusquebec.local via csmap id 78057b4c_8416_11d8_8636_0002b3e6f1b0_9736; Thu, 01 Apr 2004 14:55:04 -0500 (EST) Date: Thu, 01 Apr 2004 14:51:15 -0500 From: Tate Avery Subject: RE: Nested category strategy In-reply-to: <92AB0CC9-8415-11D8-9BCB-000A95C8F324@apple.com> To: 'Lucene Users List' Reply-to: tate.avery@nstein.com Message-id: <000e01c41822$b1fae0e0$4c0012ac@nstein.com> MIME-version: 1.0 X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2800.1165 X-Mailer: Microsoft Outlook CWS, Build 9.0.6604 (9.0.2911.0) Content-type: text/plain; charset=us-ascii Content-transfer-encoding: 7BIT Importance: Normal X-Priority: 3 (Normal) X-MSMail-priority: Normal X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Could you put them all into a tab-delimited string and store that as a single field, then use a TabTokenizer on the field to search? And, if you need to, do a .split("\t") on the field value in order to break them back up into individual categories. -----Original Message----- From: David Black [mailto:black@apple.com] Sent: Thursday, April 01, 2004 2:49 PM To: lucene-user@jakarta.apache.org Subject: Nested category strategy Hey All, I'm trying to figure out the best approach to something. Each document I index has an array of categories which looks like the following example.... /Science/Medicine/Serology/blood gas /Biology/Fluids/Blood/ etc. Anyway, there's a couple things I'm trying to deal with. 1. The fact that we have an undefined array size. I can't just shove these into a single field. I could explode them into multiple fields on the fly like category_1, category_2. etc. etc 2. The fact that a search will need to be performed like " category: /Science/Medicine/*" would need to return all items within that category. Thanks in advance to anyone who can give me some help here. Thanks --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org