Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@locus.apache.org Received: (qmail 74342 invoked from network); 18 Jun 2008 20:20:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 18 Jun 2008 20:20:14 -0000 Received: (qmail 82848 invoked by uid 500); 18 Jun 2008 20:20:15 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 82661 invoked by uid 500); 18 Jun 2008 20:20:15 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 82646 invoked by uid 99); 18 Jun 2008 20:20:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jun 2008 13:20:14 -0700 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [65.99.197.50] (HELO s01.igfoo.com) (65.99.197.50) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jun 2008 20:19:23 +0000 Received: from localhost (localhost [127.0.0.1]) by s01.igfoo.com (Postfix) with ESMTP id BD89AF3410E for ; Wed, 18 Jun 2008 15:19:10 -0500 (CDT) X-DKIM: Sendmail DKIM Filter v2.5.4 s01.igfoo.com BD89AF3410E DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=igfoo.com; s=mail; t=1213820350; bh=6Hfypi7sopxZgk7X7vI7Hptmw3ZVO4K3rFDcclOEmEw=; h=Message-ID:Date:From:MIME-Version:To:Subject:References: In-Reply-To:Content-Type:Content-Transfer-Encoding; b=iWBUs1iIrSWH 05khhQxVf2HzodJCQmjHnM3aCur6W8plYcnFGsjjOxo3TJrTt6agSCBW5cFFOW4nTCm kkbDSeG7bUZCBHlyWrMVgPWh8o4jIYh9/Psj6CZI3+6ilGxP6a2nvM/Y0hrU7Rqf+tQ 97DHnr3yRK7USXuUf8QeyyytE= X-Virus-Scanned: Debian amavisd-new at s01.igfoo.com Received: from s01.igfoo.com ([127.0.0.1]) by localhost (s01.igfoo.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DMYZGROA75kc for ; Wed, 18 Jun 2008 15:19:08 -0500 (CDT) Received: from [192.168.1.199] (pool-71-164-181-4.dllstx.fios.verizon.net [71.164.181.4]) by s01.igfoo.com (Postfix) with ESMTPSA id 02E8EF340C7 for ; Wed, 18 Jun 2008 15:19:07 -0500 (CDT) X-DKIM: Sendmail DKIM Filter v2.5.4 s01.igfoo.com 02E8EF340C7 DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=igfoo.com; s=mail; t=1213820348; bh=6Hfypi7sopxZgk7X7vI7Hptmw3ZVO4K3rFDcclOEmEw=; h=Message-ID:Date:From:MIME-Version:To:Subject:References: In-Reply-To:Content-Type:Content-Transfer-Encoding; b=jxJ5KmnQ6TMX IsL5YUFPtsXjs7GJ4AqTTSAcDGWPbn1xRin91Q9LGcSZ4XbCVFt1eLDx5GMTX2wA6/6 uDXiUP5nfUcBOp/d9yIBrxNlBTsO89FK4ZnIl0Y3JnC5MnfNLdINavr85wYLGE9cKs1 J5xEArSZhK/SoGDRJcssKb98s= Message-ID: <48596DBB.3060300@apache.org> Date: Wed, 18 Jun 2008 15:19:07 -0500 From: Dennis Kubes User-Agent: Thunderbird 2.0.0.14 (X11/20080505) MIME-Version: 1.0 To: hbase-user@hadoop.apache.org Subject: Re: Nutch + HBase References: <7e536b1f0806171039n3f85fe9fpbc61dacf17fdb26c@mail.gmail.com> In-Reply-To: <7e536b1f0806171039n3f85fe9fpbc61dacf17fdb26c@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org We have discussed it but not implemented it. A previous step before implementing interfaces to use HBase for current Nutch databases was to may the Nutch architecture itself more flexible. This is what I have been terming Nutch 2 and what I have been currently working on. Dennis Marcus Herou wrote: > Hi. > > Anyone tried to implement HBase as storage for: > > * CrawlDB > * LinkDB > * Fetched and parsed url data > > It would certainly be cool I think to be able to search in all these three > db's. Currently it is a little bit hard to use the data crawled without > actually indexing it. > > Kindly > > //Marcus > > >