Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@apache.org Received: (qmail 49572 invoked from network); 11 Mar 2002 21:29:10 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 11 Mar 2002 21:29:10 -0000 Received: (qmail 20861 invoked by uid 97); 11 Mar 2002 21:29:05 -0000 Delivered-To: qmlist-jakarta-archive-lucene-dev@jakarta.apache.org Received: (qmail 20801 invoked by uid 97); 11 Mar 2002 21:29:04 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 20746 invoked from network); 11 Mar 2002 21:29:03 -0000 content-class: urn:content-classes:message Subject: RE: Normalization MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Date: Mon, 11 Mar 2002 14:28:58 -0700 X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3 Message-ID: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Normalization Thread-Index: AcHJQ0/a0+Eg5b1STCKaztdKYU0wsgAAFjtA From: "Mark Tucker" To: "Lucene Developers List" X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N You can't learn, if you don't ask the question. Thanks for your = response. Mark -----Original Message----- From: Rodrigo Reyes [mailto:reyes@charabia.net] Sent: Monday, March 11, 2002 2:26 PM To: Lucene Developers List Subject: Re: Normalization Well, choosing XML for such a description language has the following drawbacks: * hardly legible. Having one rule per line is really nice. I = appreciated it writing the french normalizer. * it does not solve all the parsing problems. - either you have to specify everything as elements or attributes, = and it's painful : er er - either you have a write a parser anyway to parse the content of = the elements: [aeiou]r$ and therefore write a parse for the content of the xml-parsed content. Rodrigo ----- Original Message ----- From: "Mark Tucker" To: "Lucene Developers List" Sent: Monday, March 11, 2002 10:10 PM Subject: RE: Normalization > Why not use XML? > > > > > > > > > > > > > > > > > > There are some issues with the characters you use, but using XML might make it easier to extend. > > Mark -- To unsubscribe, e-mail: = For additional commands, e-mail: = -- To unsubscribe, e-mail: For additional commands, e-mail: