Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9B9DCCCB1 for ; Thu, 4 Dec 2014 14:25:52 +0000 (UTC) Received: (qmail 4916 invoked by uid 500); 4 Dec 2014 14:25:51 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 4853 invoked by uid 500); 4 Dec 2014 14:25:51 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 4841 invoked by uid 99); 4 Dec 2014 14:25:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Dec 2014 14:25:50 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of prvs=408dd1ec2=ochrist@ebsco.com designates 63.164.11.93 as permitted sender) Received: from [63.164.11.93] (HELO smtpout.ebsco.com) (63.164.11.93) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Dec 2014 14:25:46 +0000 X-IronPort-AV: E=Sophos;i="5.07,515,1413262800"; d="scan'208";a="52156383" Received: from isshqmxcasht01p.ebsco.com ([10.45.37.50]) by smtpout.ebsco.com with ESMTP/TLS/AES128-SHA; 04 Dec 2014 08:24:25 -0600 Received: from ISSHQMXDAG03P.ebsco.com ([fe80::e840:705f:ca5e:29a3]) by ISSHQMXCASHT01P.ebsco.com ([fe80::fcdf:3839:7fc9:abe%16]) with mapi id 14.03.0174.001; Thu, 4 Dec 2014 08:24:25 -0600 From: Oliver Christ To: "java-user@lucene.apache.org" , "paul_t100@fastmail.fm" Subject: RE: How best to compare tow sentences Thread-Topic: How best to compare tow sentences Thread-Index: AQHQDhywDOK5COSemEOXINK+OiopY5x+YA+AgAAImgCAABG+AIABANCg Date: Thu, 4 Dec 2014 14:24:19 +0000 Message-ID: References: <547D96B7.80205@fastmail.fm> <547F3014.8010200@fastmail.fm> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.45.37.1] Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org Q29uY2VwdHVhbGx5IHRoaXMgdXNlIGNhc2UgaXMgc2ltaWxhciB0byB3aGF0IHRyYW5zbGF0aW9u IG1lbW9yaWVzIGRvLiANCg0KRm9yIGFuIG9wZW4tc291cmNlIFRNIGVuZ2luZSwgaGF2ZSBhIGxv b2sgYXQgaHR0cDovL29rYXBpLm9wZW50YWcuY29tLywgYW5kIGl0cyBkZWZhdWx0IFRNIGVuZ2lu ZSAoUGVuc2lldmUgVE0pLiANCg0KQ2hlZXJzLCBPbGkNCg0KLS0tLS1PcmlnaW5hbCBNZXNzYWdl LS0tLS0NCkZyb206IEJhcnJ5IENvdWdobGFuIFttYWlsdG86Yi5jb3VnaGxhbjJAZ21haWwuY29t XSANClNlbnQ6IFdlZG5lc2RheSwgRGVjZW1iZXIgMDMsIDIwMTQgMTE6NDkgQU0NClRvOiBqYXZh LXVzZXJAbHVjZW5lLmFwYWNoZS5vcmc7IHBhdWxfdDEwMEBmYXN0bWFpbC5mbQ0KU3ViamVjdDog UmU6IEhvdyBiZXN0IHRvIGNvbXBhcmUgdG93IHNlbnRlbmNlcw0KDQpUaGVyZSBhcmUgdmFyaW91 cyBpbXBsZW1lbnRhdGlvbnMgb2YgRGFtZXJhdS1MZXZlbnNodGVpbiBvbmxpbmUuIEkgZG9uJ3Qg a25vdyBob3cgbXVjaCBpdCB3aWxsIGltcHJvdmUgeW91ciByZXN1bHRzIGhvd2V2ZXIuDQoNCldo eSBhcmUgeW91IG5vdCBpbmRleGluZyBhbGwgb2YgdGhlIHN0cmluZ3M/IElmIHlvdSBkb24ndCBo YXZlIHRvIGNvbXB1dGUgYWxsIHBvc3NpYmxlIHBhaXJzLCB0aGVuIHlvdSBhcmUgYmV0dGVyIG9m ZiB3aXRob3V0IEx1Y2VuZS4NCg0KTm90ZSB0aGF0IHRoZSBjb3NpbmUgc2ltaWxhcml0eSBjYWxj dWxhdGlvbiB0aGF0IEx1Y2VuZSBwZXJmb3JtcyBpcyBiYXNlZCBvbiBURi1JREYgdmFsdWVzIChk b2N1bWVudGVkIGhlcmU6DQpodHRwOi8vbHVjZW5lLmFwYWNoZS5vcmcvY29yZS80XzEwXzIvY29y ZS9vcmcvYXBhY2hlL2x1Y2VuZS9zZWFyY2gvc2ltaWxhcml0aWVzL1RGSURGU2ltaWxhcml0eS5o dG1sKS4NCkZvciB2ZXJ5IHNob3J0IHN0cmluZ3MgVEYtSURGIGlzIHZlcnkgbm9pc3kuIEl0IGRv ZXNuJ3QgbWFrZSBhIHdob2xlIGxvdCBvZiBzZW5zZSB3aGVuIHlvdSBvbmx5IGhhdmUgMiBkb2N1 bWVudHMuIEkgc3VzcGVjdCB0aGF0IHRoZSBzaW1tZXRyaWNzIENvc2luZVNpbWlsYXJpdHkgYW5k IEphY2NhcmRTaW1pbGFyaXR5IHdpbGwgZ2l2ZSB5b3UgYmV0dGVyIHJlc3VsdHMuLg0KDQpNeSBz dWdnZXN0aW9uIGlzIHRvIGZpcnN0IGNvcnJlY3QgZm9yIHR5cG9zIGJ5IGNhbGN1bGF0aW5nIHRo ZSBMZXZlbnNodGVpbiBEaXN0YW5jZSBmb3IgZWFjaCB3b3JkIGluIHNvbmd0aXRsZTEgYWdhaW5z dCBlYWNoIHdvcmQgaW4gc29uZ3RpdGxlMiwgYW5kIGFzc3VtZSB0aGF0IHdvcmRzIGFyZSB0aGUg c2FtZSBpZiB0aGUgZGlzdGFuY2UgaXMgbGVzcyB0aGFuIDIuIFRoZW4gdXNlIEphY2NhcmRTaW1p bGFyaXR5IG9yIENvc2luZVNpbWlsYXJpdHkgdG8gY2FsY3VsYXRlIHRoZSBzaW1pbGFyaXR5Lg0K DQpCYXJyeQ0KDQoNCg0KT24gV2VkLCBEZWMgMywgMjAxNCBhdCAzOjQ1IFBNLCBQYXVsIFRheWxv ciA8cGF1bF90MTAwQGZhc3RtYWlsLmZtPiB3cm90ZToNCg0KPiBPbiAwMy8xMi8yMDE0IDE1OjE0 LCBCYXJyeSBDb3VnaGxhbiB3cm90ZToNCj4NCj4+IEhpIFBhdWwsDQo+Pg0KPj4gSSBkb24ndCBo YXZlIG11Y2ggZXhwZXJ0aXNlIGluIHRoaXMgYXJlYSBzbyBob3BlZnVsbHkgb3RoZXJzIHdpbGwg DQo+PiBhbnN3ZXIsIGJ1dCBtYXliZSB0aGlzIGlzIGJldHRlciB0aGFuIG5vdGhpbmcuDQo+Pg0K Pj4gSSBkb24ndCBrbm93IG1hbnkgb3V0LW9mLXRoZS1ib3ggc29sdXRpb25zIGZvciB0aGlzIHBy b2JsZW0sIGJ1dCBJJ20gDQo+PiBzdXJlIHRoZXkgZXhpc3QuIE1haG91dCBhbmQgQ2Fycm90MiBt aWdodCBiZSB3b3J0aCBpbnZlc3RpZ2F0aW5nLg0KPj4NCj4+IFNpbWlsYXJpdHkgTWV0cmljczoN Cj4+IC0gSmFjY2FyZCBJbmRleC4gTWVhc3VyZXMgc2ltaWxhcml0eSBiZXR3ZWVuIHR3byBzZXRz LCBzbyB3b3JkIG9yZGVyIA0KPj4gaXMgbm90IGltcG9ydGFudC4NCj4+IC0gTGV2ZW5zaHRlaW4g ZGlzdGFuY2UuIElmIHlvdSBhcmUgZGVhbGluZyB3aXRoIHVzZXItaW5wdXR0ZWQgdHlwb3MsIA0K Pj4gdGhlIERhbWVyYXXigJNMZXZlbnNodGVpbiBkaXN0YW5jZSBjYW4gcGVyZm9ybSBhIGJpdCBi ZXR0ZXIgYmVjYXVzZSBpdCANCj4+IHRha2VzIGludG8gYWNjb3VudCBzd2FwcGluZyBhZGphY2Vu dCBsZXR0ZXJzIChlLmcuIHRlaCAtPiB0aGUpLg0KPj4NCj4+IEkgd29ya2VkIHdpdGggc29tZSBj b2RlIHRoYXQgZGlkIHRoaXMgZm9yIGF1dGhvciBuYW1lcyBlLmcuIG1lcmdlIA0KPj4gIkJhcmFj ayBPYmFtYSIsICJPYmFtYSBCLiIgYW5kICJCLiBILiBPYmFtYSIuIEl0IHVzZWQgYSBjb21iaW5h dGlvbiANCj4+IG9mIERhbWVyYXXigJNMZXZlbnNodGVpbiBkaXN0YW5jZSBhbmQgSmFjY2FyZCBp bmRleC4gSXQgd29ya2VkIHZlcnkgDQo+PiB3ZWxsIGZvciB0aGlzIHByb2JsZW0sIGJ1dCB1bmZv cnR1bmF0ZWx5IHRoZSBjb2RlIHdhcyBzcGFyc2Ugb24gDQo+PiBkb2N1bWVudGF0aW9uIGFuZCBm dWxsIG9mIG1hZ2ljIG51bWJlcnMgc28gSSBkb24ndCBrbm93IHRoZSBkZXRhaWxzLiANCj4+IFRo ZSBhcHByb2FjaCB3YXMgc2ltaWxhciB0byB0aGUgYXBwcm9hY2ggZGVzY3JpYmVkIGluIHRoaXMg YW5zd2VyOiANCj4+IGh0dHA6Ly9zdGFja292ZXJmbG93LmNvbS9hLw0KPj4gMTE5MjA4NjcvMjgx NDY5DQo+Pg0KPj4gVGhpcyBpcyBhbiBPKG5eMikgcGFpcndpc2UgY29tcGFyaXNvbiBwcm9ibGVt LiBBcyB5b3VyIGRhdGEgZ2V0cyANCj4+IGJpZ2dlciB5b3UgaGF2ZSB0byB3b3JrIGFyb3VuZCB0 aGlzIGxpbWl0YXRpb24uIFRoaXMgcHJvYmxlbSBpcyBrbm93biANCj4+IGluIHJlc2VhcmNoIGxp dGVyYXR1cmUgYXMgdGhlICJhbGwtcGFpcnMiIHNpbWlsYXJpdHkgcHJvYmxlbS4gVGhlIA0KPj4g cGFwZXIgbGlua2VkIGZyb20gdGhpcyByZXBvc2l0b3J5IGlzIGEgZ29vZCByZWFkIG9uIHRoZSBz dWJqZWN0OiANCj4+IGh0dHBzOi8vY29kZS5nb29nbGUuY29tL3AvIGdvb2dsZS1hbGwtcGFpcnMt c2ltaWxhcml0eS1zZWFyY2gvDQo+Pg0KPj4gT25lIG9mIHRoZSB3YXlzIHlvdSBjYW4gd29yayBh cm91bmQgdGhpcyBpcyBieSB1c2luZyBMdWNlbmUgdG8gbGltaXQgDQo+PiB0aGUgYW1vdW50IG9m IGNvbXBhcmlzb25zIHlvdSBuZWVkIHRvIGRvOg0KPj4gLSBJbmRleCB5b3VyIGRvY3VtZW50cy4N Cj4+IC0gRm9yIGVhY2ggc29uZyB0aXRsZSBkbyBhIGZ1enp5IHNlYXJjaCBvZiB0aGUgd29yZHMu DQo+PiAtIFRha2UgdGhlIHRvcCBOIHJlc3VsdHMsIGNhbGN1bGF0ZSB0aGVpciBzaW1pbGFyaXR5 IHdpdGggdGhlIHNvbmcgDQo+PiB0aXRsZSB1c2luZyB0aGUgbWV0cmljcyAob3IganVzdCB1c2Ug dGhlIEx1Y2VuZSBzY29yZSkuDQo+PiAtIENsdXN0ZXIgc2ltaWxhciB0aXRsZXMgYnkgc29uZyB0 aXRsZS4NCj4+DQo+PiBUaGlzIGlzIGJhc2ljYWxseSBjcmVhdGluZyBhIHNwYXJzZSBpbnZlcnRl ZCBpbmRleCBvZiB5b3VyIGRvY3VtZW50IA0KPj4gbWF0cml4LCBzbyB0aGF0IHlvdSBjYW4gZmlu ZCByZXN1bHRzIHRoYXQgd2lsbCBwcm9kdWNlIG5vbi16ZXJvIA0KPj4gc2ltaWxhcml0aWVzLiBU aGlzIGlzIHRoZSBtb3N0IGVmZmVjdGl2ZSB3YXkgb2Ygb3B0aW1pemluZyANCj4+IHBlcmZvcm1h bmNlIHRoYXQgSSBoYXZlIGVuY291bnRlcmVkLg0KPj4NCj4+IEFnYWluLCBJJ20gc3VyZSB0aGVy ZSBhcmUgb3V0LW9mLXRoZS1ib3ggc29sdXRpb25zIGZvciB0aGlzIHByb2JsZW0sIA0KPj4gYnV0 IEkgZG9uJ3Qga25vdyB3aGF0IHRoZXkgYXJlLg0KPj4NCj4+IEhvcGUgdGhhdCBoZWxwcywNCj4+ IEJhcnJ5DQo+Pg0KPj4gIFRoYW5reW91IGJhcnJ5IEkgd2lsIHNwZW5kIHNvbWUgdGltZSBnb2lu ZyB0aHJvdWdoIHlvdXIgc3VnZ2VzdGlvbnMsIA0KPj4gaW4NCj4gdGhlIGxpYnJhcnkgSW0gbG9v a2luZyBhdCBJIGRvbnQgc2VlbSB0byBoYXZlIERhbWVyYXXigJNMZXZlbnNodGVpbiBidXQgDQo+ IEkgZG8gaGF2ZSBqYWNjYXJkU2ltaWxhcml0eSBzbyBpZiB0aGF0IHVuZGVyc3RhbmRzIHdvcmRz IElsbCB3aWxsIGdpdmUgaXQgYSB0cnkuDQo+DQo+IHxCbG9ja0Rpc3RhbmNlDQo+IENoYXBtYW5M ZW5ndGhEZXZpYXRpb24NCj4gQ2hhcG1hbk1hdGNoaW5nU291bmRleA0KPiBDaGFwbWFuTWVhbkxl bmd0aA0KPiBDaGFwbWFuT3JkZXJlZE5hbWVDb21wb3VuZFNpbWlsYXJpdHkNCj4gQ29zaW5lU2lt aWxhcml0eQ0KPiBEaWNlU2ltaWxhcml0eQ0KPiBFdWNsaWRlYW5EaXN0YW5jZQ0KPiBJbnRlcmZh Y2VTdHJpbmdNZXRyaWMNCj4gSmFjY2FyZFNpbWlsYXJpdHkNCj4gSmFybw0KPiBKYXJvV2lua2xl cg0KPiBMZXZlbnNodGVpbg0KPiBNYXRjaGluZ0NvZWZmaWNpZW50DQo+IE1vbmdlRWxrYW4NCj4g TmVlZGxlbWFuV3VuY2gNCj4gT3ZlcmxhcENvZWZmaWNpZW50DQo+IFFHcmFtc0Rpc3RhbmNlDQo+ IFNtaXRoV2F0ZXJtYW4NCj4gU21pdGhXYXRlcm1hbkdvdG9oDQo+IFNtaXRoV2F0ZXJtYW5Hb3Rv aFdpbmRvd2VkQWZmaW5lDQo+IFNvdW5kZXgNCj4gfA0KPiBPbmUgdGhpbmdzLCByZWdhcmlkbmcg eW91ciBsdWNlbmUgYmFzZWQgc29sdXRpb24gSSB0aGluayB5b3UgaGF2ZSANCj4gbWlzc2VkIGFu IGltcG9ydGFudCBwb2ludC4gSSBhbSBvbmx5IGNvbXBhcmluZyBUV08gc3RyaW5ncyBhdCBhbnkg DQo+IHRpbWUsIEkgZG9udCBoYXZlIGEgZGF0YXNldCBvZiB0aG91c2FuZHMgb2Ygc2VudGVuY2Vz IHRoYXQgSSB3YW50IHRvIA0KPiBjb21wYXJlIG92ZXIgdGltZSBJIGxpdGVyYWxseSBoYXZlIHN0 cmluZyBhIGFuZCBzdHJpbmcgYiBhbmQgSSBqdXN0IA0KPiB3YW50IHRvIGNvbXBhcmUgdGhvc2Us IGF0IGEgbGF0ZXIgZGF0ZSBJbGwgaGF2ZSBzdHJpbmcgYyBhbmQgZCwgYnV0IGF0IA0KPiBubyBw b2ludCBkbyBJIGhhdmUgc3RyaW5ncyBhLGIsYyxkLiBJJ20gbm90IHRyeWluZyB0byBmaW5kIHRo ZSBiZXN0ICANCj4gbWF0Y2hpbmcgc3RyaW5nIGZvciBhIHNpbmdsZSB0aXRsZSBqdXN0IGlzIHRo aXMgU3RyaW5nIGEgZ29vZCBtYXRjaCBmb3IgdGhpcyBzb25nIHRpdGxlLg0KPg0KPiBQYXVsDQo+ DQo= DQotLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0t LS0tLS0tLS0tLS0tLS0tLS0NClRvIHVuc3Vic2NyaWJlLCBlLW1haWw6IGphdmEtdXNlci11 bnN1YnNjcmliZUBsdWNlbmUuYXBhY2hlLm9yZw0KRm9yIGFkZGl0aW9uYWwgY29tbWFuZHMs IGUtbWFpbDogamF2YS11c2VyLWhlbHBAbHVjZW5lLmFwYWNoZS5vcmcNCg0K