Return-Path: Delivered-To: apmail-xml-cocoon-users-archive@xml.apache.org Received: (qmail 14009 invoked by uid 500); 12 Jul 2002 15:48:11 -0000 Mailing-List: contact cocoon-users-help@xml.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: list-post: Reply-To: cocoon-users@xml.apache.org Delivered-To: mailing list cocoon-users@xml.apache.org Received: (qmail 13998 invoked from network); 12 Jul 2002 15:48:11 -0000 Message-ID: <20020712154814.95433.qmail@web20808.mail.yahoo.com> Date: Fri, 12 Jul 2002 08:48:14 -0700 (PDT) From: icewind Subject: Lucene, CocoonIndexer To: cocoon-users@xml.apache.org MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N I have created an index of some XML documents but I'm not thrilled with the way the index is built. Text appears to get indexed with the innermost XML tag it is found in. For example, if I had a fragment like the following: <person>Alice's</person> guide to the great novels of the <date>1800's</date> and I then used the following search term: "title:Alice" or "title:1800", I would not get a match. I would need to search for "person:Alice" or "date:1800" respectively. Since all the tags within the tag contain text that are clearly part of the title, I want a user who is searching through the collection to be able to do title specific searches that match any word within the title tag, regardless of whether it has other XML tags wrapped around it. Has anyone run into this issue? I'm not sure how to go about implementing what I want. Is this something I could do in Cocoon, or would I have to modify something in the LuceneXMLIndexer component? Suggestions appreciated. I imagine someone has run into this and has already come up with a workable solution. __________________________________________________ Do You Yahoo!? Sign up for SBC Yahoo! Dial - First Month Free http://sbc.yahoo.com --------------------------------------------------------------------- Please check that your question has not already been answered in the FAQ before posting. <http://xml.apache.org/cocoon/faq/index.html> To unsubscribe, e-mail: <cocoon-users-unsubscribe@xml.apache.org> For additional commands, e-mail: <cocoon-users-help@xml.apache.org>