Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 89902 invoked from network); 19 Apr 2005 19:55:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 19 Apr 2005 19:55:49 -0000 Received: (qmail 7209 invoked by uid 500); 19 Apr 2005 19:55:44 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 7167 invoked by uid 500); 19 Apr 2005 19:55:43 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 7150 invoked by uid 99); 19 Apr 2005 19:55:43 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=FORGED_RCVD_HELO X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from corp-206.dfki.uni-sb.de (HELO mail.dfki.de) (134.96.188.26) by apache.org (qpsmtpd/0.28) with ESMTP; Tue, 19 Apr 2005 12:55:42 -0700 Received: from mail.dfki.de (localhost.dfki.uni-sb.de [127.0.0.1]) by localhost (Postfix) with ESMTP id 49954E47F3 for ; Tue, 19 Apr 2005 21:55:39 +0200 (CEST) Received: from [IPv6:::1] (dave.dfki.uni-sb.de [134.96.184.220]) by mail.dfki.de (Postfix) with ESMTP id 25098E47F0 for ; Tue, 19 Apr 2005 21:55:39 +0200 (CEST) Mime-Version: 1.0 (Apple Message framework v622) Content-Transfer-Encoding: 7bit Message-Id: <4eff27aa24b467385fd37f17e300a286@activemath.org> Content-Type: text/plain; charset=US-ASCII; format=flowed To: java-user@lucene.apache.org From: Paul Libbrecht Subject: Passing XML objects to the analyzer ? Date: Tue, 19 Apr 2005 21:55:36 +0200 X-Mailer: Apple Mail (2.622) X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hi, I am working on an index to search XML data in a fixed format that I master well... The idea is that the XML content (which I have as JDOM object) actually carries the semantic which would be best converted directly into tokens by something like an analyzer. However, adding fields is done not using the result of the analysis (or a stream thereof) but using readers or strings. I have two choices and would like to know what's the best: - make the text passed to the analyzer a simple "instruction" which will fetch the XML objects and do the analysis there - make a pre-analysis step which converts it into tokens of text which then my analyzer catches again. I'd be more inclined for the first solution but I fear there's a catch. Is there one ? paul --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org