Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 11569 invoked from network); 8 Feb 2011 14:12:39 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Feb 2011 14:12:39 -0000 Received: (qmail 76863 invoked by uid 500); 8 Feb 2011 14:12:38 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 76243 invoked by uid 500); 8 Feb 2011 14:12:35 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 76235 invoked by uid 500); 8 Feb 2011 14:12:33 -0000 Delivered-To: apmail-lucene-solr-dev@lucene.apache.org Received: (qmail 76232 invoked by uid 99); 8 Feb 2011 14:12:33 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Feb 2011 14:12:33 +0000 X-ASF-Spam-Status: No, hits=2.0 required=5.0 tests=SPF_NEUTRAL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [216.139.236.26] (HELO sam.nabble.com) (216.139.236.26) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Feb 2011 14:12:27 +0000 Received: from ben.nabble.com ([192.168.236.152]) by sam.nabble.com with esmtp (Exim 4.69) (envelope-from ) id 1PmoIX-0006hl-OT for solr-dev@lucene.apache.org; Tue, 08 Feb 2011 06:12:05 -0800 Date: Tue, 8 Feb 2011 06:12:05 -0800 (PST) From: "David Smiley (@MITRE.org)" To: solr-dev@lucene.apache.org Message-ID: <1297174325750-2451504.post@n3.nabble.com> In-Reply-To: <2D127F11DC79714E9B6A43AC9458147F9C1437FD@suex07-mbx-03.ad.syr.edu> References: <1297136069623-2448919.post@n3.nabble.com> <2D127F11DC79714E9B6A43AC9458147F9C1437FD@suex07-mbx-03.ad.syr.edu> Subject: RE: Should ASCIIFoldingFilter be deprecated? MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Chris Hostetter-3 wrote: >=20 > CharFilters and TokenFilters have different purposes though... >=20 > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#When_To_use_a= _CharFilter_vs_a_TokenFilter >=20 > (ie: If you use MappingCharFilter, you can't then tokenize on some of the= =20 > characters you filtered away) >=20 Right, but it=E2=80=99s hard to imagine wanting to tokenize on an accent ch= aracter or some other modification specified in these particular mapping files. Steven A Rowe wrote: >=20 > AFAIK, ISOLatin1AccentFilter was deprecated because ASCIIFoldingFilter > provides a superset of it mappings. >=20 *If* that is the case then this file should also be removed: solr/example/solr/conf/mapping-ISOLatin1Accent.txt Steven A Rowe wrote: >=20 > I haven't done any benchmarking, but I'm pretty sure that > ASCIIFoldingFilter can achieve a significantly higher throughput rate tha= n > MappingCharFilter, and given that, it probably makes sense to keep both, > to allow people to make the choice about the tradeoff between the > flexibility provided by the human-readable (and editable) mapping file an= d > the speed provided by ASCIIFoldingFilter. >=20 I'm skeptical that whatever the difference is is relevant in the scheme of things. The cost to keeping it is introducing confusion on users, and more code to maintain. ~ David Smiley ----- Author: https://www.packtpub.com/solr-1-4-enterprise-search-server/book --=20 View this message in context: http://lucene.472066.n3.nabble.com/Should-ASC= IIFoldingFilter-be-deprecated-tp2448919p2451504.html Sent from the Solr - Dev mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org