Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 30371 invoked from network); 23 Jun 2010 18:22:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 23 Jun 2010 18:22:08 -0000 Received: (qmail 22098 invoked by uid 500); 23 Jun 2010 18:22:06 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 22042 invoked by uid 500); 23 Jun 2010 18:22:05 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 22033 invoked by uid 99); 23 Jun 2010 18:22:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Jun 2010 18:22:05 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=10.0 tests=AWL,RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [128.230.18.92] (HELO smtp2.syr.edu) (128.230.18.92) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Jun 2010 18:21:56 +0000 Received: from suex07-hub-01.ad.syr.edu (suex07-hub-01.ad.syr.edu [128.230.108.195]) by smtp2.syr.edu (8.14.3/8.14.3) with ESMTP id o5NILZmZ013208 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=FAIL) for ; Wed, 23 Jun 2010 14:21:35 -0400 Received: from suex07-mbx-03.ad.syr.edu ([128.230.108.133]) by suex07-hub-01.ad.syr.edu ([2002:80e6:6cc3::80e6:6cc3]) with mapi; Wed, 23 Jun 2010 14:21:35 -0400 From: Steven A Rowe To: "java-user@lucene.apache.org" Date: Wed, 23 Jun 2010 14:21:13 -0400 Subject: RE: URL Tokenization Thread-Topic: URL Tokenization Thread-Index: AcsS/uRJWVEI/dwKSDCdMazIeQBmUgAAT4Wg Message-ID: <2D127F11DC79714E9B6A43AC9458147F764DA5D5@suex07-mbx-03.ad.syr.edu> References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=1.12.8161:2.4.5,1.2.40,4.0.166 definitions=2010-06-23_02:2010-02-06,2010-06-23,2010-06-23 signatures=0 X-Proofpoint-Spam-Reason: safe SGkgU3VkaGEsDQoNClRoZXJlIGlzIHN1Y2ggYSB0b2tlbml6ZXIsIG5hbWVkIE5ld1N0YW5kYXJk VG9rZW5pemVyLCBpbiB0aGUgbW9zdCByZWNlbnQgcGF0Y2ggb24gdGhlIGZvbGxvd2luZyBKSVJB IGlzc3VlOiANCg0KICAgaHR0cHM6Ly9pc3N1ZXMuYXBhY2hlLm9yZy9qaXJhL2Jyb3dzZS9MVUNF TkUtMjE2Nw0KDQpJdCBrZWVwcyAoSFRUUChTKSwgRlRQLCBhbmQgRklMRSkgVVJMcyB0b2dldGhl ciBhcyBzaW5nbGUgdG9rZW5zLCBhbmQgZS1tYWlscyB0b28sIGluIGFjY29yZGFuY2Ugd2l0aCB0 aGUgcmVsZXZhbnQgSUVURiBSRkNzLg0KDQpTdGV2ZQ0KDQo+IC0tLS0tT3JpZ2luYWwgTWVzc2Fn ZS0tLS0tDQo+IEZyb206IFN1ZGhhIFZlcm1hIFttYWlsdG86dmVybWEuc3VkaGFAZ21haWwuY29t XQ0KPiBTZW50OiBXZWRuZXNkYXksIEp1bmUgMjMsIDIwMTAgMjowNyBQTQ0KPiBUbzogamF2YS11 c2VyQGx1Y2VuZS5hcGFjaGUub3JnDQo+IFN1YmplY3Q6IFVSTCBUb2tlbml6YXRpb24NCj4gDQo+ IEhpLA0KPiANCj4gSSBhbSBuZXcgdG8gbHVjZW5lIGFuZCBJIGFtIHVzaW5nIEx1Y2VuZSAzLjAu Mi4NCj4gDQo+IEkgYW0gdXNpbmcgTHVjZW5lIHRvIHBhcnNlIHRleHQgd2hpY2ggbWF5IGNvbnRh aW4gVVJMcy4gSSBub3RpY2VkIHRoZQ0KPiBTdGFuZGFyZFRva2VuaXplciBrZWVwcyB0aGUgZW1h aWwgYWRkcmVzc2VzIGluIG9uZSB0b2tlbiwgYnV0IG5vdCB0aGUNCj4gVVJMcy4NCj4gSSBhbHNv IGxvb2tlZCBhdCBTb2xyIHdpa2kgcGFnZXMsIGFuZCBldmVuIHRob3VnaCB0aGUgd2lraSBwYWdl IGZvcg0KPiBzb2xyLlN0YW5kYXJkVG9rZW5pemVyRmFjdG9yeSBzYXlzIGl0IGtlZXBzIHRyYWNr IG9mIHRoZSBVUkwgdG9rZW4gdHlwZSAtDQo+IGl0IGRvZXMgbm90IHNlZW0gdG8gYmUgdGhlIGNh c2UuDQo+IA0KPiBJcyB0aGVyZSBhbiBBbmFseXplciBpbXBsZW1lbnRhdGlvbiB0aGF0IGNhbiBr ZWVwIHRoZSBVUkxzIGludGFjdCBpbnRvIG9uZQ0KPiB0b2tlbj8gb3IgZG9lcyBhbnlvbmUgaGF2 ZSBhbiBleGFtcGxlIG9mIHRoYXQgZm9yIFNvbHIgb3IgTHVjZW5lPw0KPiANCj4gVGhhbmtzIG11 Y2gsDQo+IFN1ZGhhDQo=