Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4B728F0DA for ; Sat, 30 Mar 2013 06:32:16 +0000 (UTC) Received: (qmail 30113 invoked by uid 500); 30 Mar 2013 06:32:11 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 29990 invoked by uid 500); 30 Mar 2013 06:32:11 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 29948 invoked by uid 99); 30 Mar 2013 06:32:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 30 Mar 2013 06:32:09 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.215.49] (HELO mail-la0-f49.google.com) (209.85.215.49) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 30 Mar 2013 06:32:05 +0000 Received: by mail-la0-f49.google.com with SMTP id fs13so950469lab.8 for ; Fri, 29 Mar 2013 23:31:43 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type:x-gm-message-state; bh=VRI9832tkyVw09QEJj8YfS2vh7Fq9IA7vH0NKMNPB8A=; b=erICQBu0APuIcKiiLOkPw/4GcZKy39VKEtrdHqSHDABzORYH+mhNyMEav/otoDMVEB rlEgQ+fDFHJcRFS8S4KH2M/yaF0VFFUfaZNSIRNVUrnFsnEFcbDJv8Q5E7IKWrfDiD5s 2JP1ntacZybhL0/a3h+afYzxgwvwBLGrKJ/nK0h0mfmPtyS9ZM+lk+a2aYTQFYCFzrvm x4hX5oJC2s4sv8q79t2zZDrFGk+1wP6JNNiyjLT0nFeHjO5o9AaGVKjKhHpWgNJE3HgQ 6wXEvvkM6t/XehHvv16FLDam86K+u5XEbY/1JBx1MBo7KBgaW7v/5642tZ80mZq7k5sM 0GcQ== X-Received: by 10.152.134.164 with SMTP id pl4mr2286959lab.54.1364625103248; Fri, 29 Mar 2013 23:31:43 -0700 (PDT) MIME-Version: 1.0 Received: by 10.114.37.5 with HTTP; Fri, 29 Mar 2013 23:31:23 -0700 (PDT) In-Reply-To: <1364574998.26149.YahooMailNeo@web194705.mail.sg3.yahoo.com> References: <1364443310.25186.YahooMailNeo@web194706.mail.sg3.yahoo.com> <1364446865.81556.YahooMailNeo@web194701.mail.sg3.yahoo.com> <1364447609.73017.YahooMailNeo@web194706.mail.sg3.yahoo.com> <1364574998.26149.YahooMailNeo@web194705.mail.sg3.yahoo.com> From: Ted Dunning Date: Sat, 30 Mar 2013 07:31:23 +0100 Message-ID: Subject: Re: Bloom Filter analogy in SQL To: "common-user@hadoop.apache.org" , Sai Sai Content-Type: multipart/alternative; boundary=f46d043bd93c87e3dd04d91e8ad6 X-Gm-Message-State: ALoCoQnrFkdZT0x+SeB3y4GRQoz810S0dscfhLZHIF2ZFgVyqSz2WbGLYahwsFXuRYUh1BSLL5AW X-Virus-Checked: Checked by ClamAV on apache.org --f46d043bd93c87e3dd04d91e8ad6 Content-Type: text/plain; charset=ISO-8859-1 This isn't a very Hadoop question. A Bloom filter is a very low level data structure that doesn't really any correlate in SQL. It allows you to find duplicates quickly and probabilistically. In return for a small probability of a false positive, it uses less memory. On Fri, Mar 29, 2013 at 5:36 PM, Sai Sai wrote: > Can some one give a simple analogy of Bloom Filter in SQL. > I am trying to understand and always get confused. > Thanks > --f46d043bd93c87e3dd04d91e8ad6 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable

This isn't a very Hadoop question.

A Bloom filter is a very low level data st= ructure that doesn't really any correlate in SQL. =A0It allows you to f= ind duplicates quickly and probabilistically. =A0In return for a small prob= ability of a false positive, it uses less memory.


On Fri,= Mar 29, 2013 at 5:36 PM, Sai Sai <saigraph@yahoo.in> wrote:=
Can some one give a simple analogy of Bloom Filter in SQL.
I am trying to understand and always get confused.
Thanks=

--f46d043bd93c87e3dd04d91e8ad6--