Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 20938 invoked from network); 17 Oct 2006 16:46:16 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 17 Oct 2006 16:46:16 -0000 Received: (qmail 14488 invoked by uid 500); 17 Oct 2006 16:46:09 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 14462 invoked by uid 500); 17 Oct 2006 16:46:09 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 14451 invoked by uid 99); 17 Oct 2006 16:46:09 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Oct 2006 09:46:09 -0700 X-ASF-Spam-Status: No, hits=2.5 required=10.0 tests=DNS_FROM_RFC_ABUSE,HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of karl.wettin@gmail.com designates 64.233.182.188 as permitted sender) Received: from [64.233.182.188] (HELO nf-out-0910.google.com) (64.233.182.188) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Oct 2006 09:46:08 -0700 Received: by nf-out-0910.google.com with SMTP id b2so330069nfe for ; Tue, 17 Oct 2006 09:45:47 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:mime-version:in-reply-to:references:content-type:message-id:from:subject:date:to:x-mailer; b=C+EqqSMrtqdqcQoSndW4yrSiK/PF7eGccDojuhwzKdyS1st4SroWFh3BYTLMkRS8yJhlPjMtqYYVnhS60emxDmiNvztpm+90QuUOZHNGqoL46Dx/R9HStcUAd/5A8zPdrvwmwidukkNeRd0ZolpuLNUCMRBGFnMrNzuxrn0y/Rg= Received: by 10.48.162.15 with SMTP id k15mr1913814nfe; Tue, 17 Oct 2006 09:45:46 -0700 (PDT) Received: from ?192.168.0.102? ( [83.249.40.19]) by mx.google.com with ESMTP id a24sm1697577nfc.2006.10.17.09.45.44; Tue, 17 Oct 2006 09:45:46 -0700 (PDT) Mime-Version: 1.0 (Apple Message framework v752.2) In-Reply-To: References: Content-Type: multipart/alternative; boundary=Apple-Mail-10-260217116 Message-Id: From: karl wettin Subject: Re: near duplicates Date: Tue, 17 Oct 2006 18:45:27 +0200 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.752.2) X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N --Apple-Mail-10-260217116 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed 17 okt 2006 kl. 17.54 skrev Find Me: > How to eliminate near duplicates from the index? Oh, one more thing. You should probably look at the norms in order to avoid comparing all documents to each other. --Apple-Mail-10-260217116--