Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 57729 invoked from network); 16 May 2007 13:51:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 May 2007 13:51:26 -0000 Received: (qmail 10507 invoked by uid 500); 16 May 2007 13:51:25 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 10301 invoked by uid 500); 16 May 2007 13:51:24 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 10286 invoked by uid 99); 16 May 2007 13:51:24 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 May 2007 06:51:24 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of erickerickson@gmail.com designates 209.85.134.190 as permitted sender) Received: from [209.85.134.190] (HELO mu-out-0910.google.com) (209.85.134.190) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 May 2007 06:51:17 -0700 Received: by mu-out-0910.google.com with SMTP id i10so102753mue for ; Wed, 16 May 2007 06:50:56 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=oloF36pFsEvPOhvDk4XrDcLg+gS7Cfs46El9o47PjZ8iZXGttvjN0CMg0M+O7wbct2m/3fJdj8Do1muGfuqliP5qLndG8TDIXngzlXYo0alXpdO2f62TokolrF3C9n8sfI34yoFxaOjeY9v83Sqe3mR8UAFNMOhW4woxeyxKhFc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=Nv47k7Yil1SjKvIeiAWjg/GxI+KnBMs2aJdNrnkDGxXpxZTpH9CDDVHaLN6LyEfiIimMWqbE8xuoVdC6FGb/2XhusDatRZ0UPAhx6bwHCOR03Wwdk1SM6QeKfT7kzyiotmVpNv5u42UBw0VMeDEnJOO6yYEg/ESf7xLFZmfurO0= Received: by 10.82.162.14 with SMTP id k14mr5148478bue.1179323455414; Wed, 16 May 2007 06:50:55 -0700 (PDT) Received: by 10.82.167.12 with HTTP; Wed, 16 May 2007 06:50:55 -0700 (PDT) Message-ID: <359a92830705160650j36688fd7g5d13d5e5b8296695@mail.gmail.com> Date: Wed, 16 May 2007 09:50:55 -0400 From: "Erick Erickson" To: java-user@lucene.apache.org Subject: Re: Concept Search In-Reply-To: <464B040B.3020202@syr.edu> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_9468_27106689.1179323455373" References: <648393.53019.qm@web80203.mail.mud.yahoo.com> <464B040B.3020202@syr.edu> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_9468_27106689.1179323455373 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline That's interesting. I suppose you could add the "synonym" of WildAnimals$ whenever you encountered any of the items in your list, then when concept searching is called for, search on WildAnimals$. Highlighting might be tricky, but certainly do-able, especially with the capabilities of a MemoryIndex...... Erick On 5/16/07, Steven Rowe wrote: > > Hi Charles, > > The need presented by your use case sounds very similar to that served > by the SynonymAnalyzer given in Erik Hatcher's and Otis Gospodnetic's > excellent book "Lucene in Action" - take a look: > > http://lucenebook.com/ > > Steve > > Charles Patridge wrote: > > I have looked around on Lucene web site as well as some documentation > > but have not found anything to do with Concept Search. > > > > My definition of Concept Search is as follows: > > > > 1. I would have a file (list) of various phrases / N-grams which I > > would like to Lucene to use as a search basis without having to type in > > all these phrases manually, and have Lucene return the results as it > > would normally if a single search query was entered. > > > > 2. An example would be - find !Wild_Animals! - where the "!" would > > indicate that this is a search that would use a file (ie > > Wild_Animals.txt) and read in the various phrases within this file and > perform the > > search in the corpus for these phrases. > > > > 3. The contents of Wild_Animals.txt could look like this: > > BUFFALO > > BEAR > > MOOSE > > COYOTE > > WOLF > > MOUNTAIN GOAT > > MOUNTAIN SHEEP > > DALL SHEEP > > DEER > > KODIAK BEAR > > BROWN BEAR > > BLACK BEAR > > etc etc etc > > > > 4. Is my idea of a Concept Search feasible / doable??? If so, can > > you point me to any documentation that exists whereby this could be done > > within Lucene > > > > Please send any info you have on this to me - > > Charles_S_Patridge@prodigy.net > > > > Thank you in advance for your time and efforts. > > > > > > > > Charles S Patridge - PDPC, Ltd. > > 172 Monce Road - Burlington, CT 06013 USA > > Email: Charles_S_Patridge@prodigy.net > > Web: http://www.sconsig.com > > Web: http://pages.prodigy.net/charles_s_patridge > > Web: http://www.munic.state.ct.us/burlington > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > ------=_Part_9468_27106689.1179323455373--