Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 97144 invoked from network); 24 Mar 2009 18:24:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 24 Mar 2009 18:24:11 -0000 Received: (qmail 40197 invoked by uid 500); 24 Mar 2009 18:24:09 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 40132 invoked by uid 500); 24 Mar 2009 18:24:09 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 40121 invoked by uid 99); 24 Mar 2009 18:24:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Mar 2009 18:24:09 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of markkerzner@gmail.com designates 209.85.218.176 as permitted sender) Received: from [209.85.218.176] (HELO mail-bw0-f176.google.com) (209.85.218.176) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Mar 2009 18:23:59 +0000 Received: by bwz24 with SMTP id 24so2433916bwz.29 for ; Tue, 24 Mar 2009 11:23:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type; bh=NvQU3VhGpAsDlDUAb9kbzgMjwRjZrdG6htg5V0GH/GA=; b=P8/l84SsDW2usmDMnp8DZeOSlEkvDLvSplGQ2PlxQh7vpyjXKQGaf+iYEnP/GlKCAC 5Qw5QdKE5e+zAiWei03ZBw9AdBSsixem6j7cYBKatdtdRcNfTfhJCQqbktJOzIAWpmpi RWt5GP0g7NWRrfhM5sKhbPnH1USg+/HtmJ8Ww= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=Y7QU1WduFGvtYOJIrYx2Y293NwUjhqRKlkqj0DlUwNWS5JV7dMFAaNcg5zHdc6nV6E /VEJlHjnG5bTx4cMev3xehZ+ViXb5SXzQAMw0PWE9XbbCpznuTZdH197saD4eGf9itpX +V/ALjclIkkakCSX5sRKfGBRl1SbpEptL9Rgw= MIME-Version: 1.0 Received: by 10.239.172.72 with SMTP id z8mr179409hbe.25.1237919019211; Tue, 24 Mar 2009 11:23:39 -0700 (PDT) In-Reply-To: <49C8F38D.4020305@yahoo-inc.com> References: <49C8F38D.4020305@yahoo-inc.com> Date: Tue, 24 Mar 2009 13:23:39 -0500 Message-ID: Subject: Re: Broder or other near-duplicate algorithms? From: Mark Kerzner To: core-user@hadoop.apache.org Cc: "rafan@yahoo-inc.com" Content-Type: multipart/alternative; boundary=001485f5cd92669acc0465e17ba3 X-Virus-Checked: Checked by ClamAV on apache.org --001485f5cd92669acc0465e17ba3 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Yi-Kai, that's good to know - and I have read this article - but is your code available? Thank you, Mark On Tue, Mar 24, 2009 at 9:51 AM, Yi-Kai Tsai wrote: > hi Mark > > we had done something on top of hadoop/hbase (mapreduce for evaluation , > hbase for online serving ) > by reference http://www2007.org/papers/paper215.pdf > > Hi, >> >> does anybody know of an open-source implementation of the Broder >> algorithmin Hadoop? >> Monika Henzinger reports >> having done >> so >> in MapReduce, and I wonder if somebody has repeated her work in open >> source? >> >> I am going to do this if there is no implementation yet, and then I will >> ask >> what I can do with the code. >> >> Cheers, >> Mark >> >> > > > -- > Yi-Kai Tsai (cuma) , Asia Search Engineering. > > --001485f5cd92669acc0465e17ba3--