Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 80833 invoked from network); 22 Apr 2005 10:19:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 22 Apr 2005 10:19:30 -0000 Received: (qmail 43995 invoked by uid 500); 22 Apr 2005 10:19:41 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 43872 invoked by uid 500); 22 Apr 2005 10:19:40 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 43855 invoked by uid 99); 22 Apr 2005 10:19:40 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=FORGED_RCVD_HELO X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from corp-206.dfki.uni-sb.de (HELO mail.dfki.de) (134.96.188.26) by apache.org (qpsmtpd/0.28) with ESMTP; Fri, 22 Apr 2005 03:19:39 -0700 Received: from mail.dfki.de (localhost.dfki.uni-sb.de [127.0.0.1]) by localhost (Postfix) with ESMTP id D708BE47CB for ; Fri, 22 Apr 2005 12:19:14 +0200 (CEST) Received: from [IPv6:::1] (dave.dfki.uni-sb.de [134.96.184.220]) by mail.dfki.de (Postfix) with ESMTP id BE1BDE47C4 for ; Fri, 22 Apr 2005 12:19:14 +0200 (CEST) Mime-Version: 1.0 (Apple Message framework v622) In-Reply-To: <4268A980.3050807@culture.gouv.fr> References: <042220050249.13376.426866200006966E000034402200762194900A0B020E089B0A@comcast.net> <4268A980.3050807@culture.gouv.fr> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Message-Id: Content-Transfer-Encoding: quoted-printable From: Paul Libbrecht Subject: Re: token type question Date: Fri, 22 Apr 2005 12:19:13 +0200 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.622) X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Le 22 avr. 05, =E0 09:36, Pierrick Brihaye a =E9crit : >> Are you say that I should construct Token in analyzer like >> new Token ("chem_H2O", 100, 103, "chem"); >> note that chem_ is added prefix to H2O, and 100 to 103 is length of=20= >> H2O rather than chem_H2O? > Well... 100 to 103 are offsets provided by the reader (an are thus=20 > usually offsets in the source file). These offsets may help you to=20 > make some computations but they are lost when the token is indexed. ?? not in all situations, or ?? You couldn't render it back otherwise=20 as is done search-result highlighting. One thing that tackles me is how much this parameter could, again, be=20 something different... In particular, I'd much prefer to have it a tree-path instead of a=20 plain number. I don't have reader plain numbers and they are, often,=20 lost in an XML content-base. hopeable ? thanks paul= --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org