Return-Path: Delivered-To: apmail-jackrabbit-dev-archive@www.apache.org Received: (qmail 29871 invoked from network); 19 Feb 2009 11:48:52 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 19 Feb 2009 11:48:52 -0000 Received: (qmail 98753 invoked by uid 500); 19 Feb 2009 11:48:51 -0000 Delivered-To: apmail-jackrabbit-dev-archive@jackrabbit.apache.org Received: (qmail 98727 invoked by uid 500); 19 Feb 2009 11:48:50 -0000 Mailing-List: contact dev-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@jackrabbit.apache.org Delivered-To: mailing list dev@jackrabbit.apache.org Received: (qmail 98718 invoked by uid 99); 19 Feb 2009 11:48:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Feb 2009 03:48:50 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mreutegg@day.com designates 207.126.148.181 as permitted sender) Received: from [207.126.148.181] (HELO eu3sys201aog001.obsmtp.com) (207.126.148.181) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 19 Feb 2009 11:48:41 +0000 Received: from source ([209.85.218.167]) by eu3sys201aob001.postini.com ([207.126.154.11]) with SMTP ID DSNKSZ1HBEbGH61XJsjr3Dj+HzD/oIXH33sa@postini.com; Thu, 19 Feb 2009 11:48:21 UTC Received: by bwz11 with SMTP id 11so971662bwz.22 for ; Thu, 19 Feb 2009 03:48:20 -0800 (PST) MIME-Version: 1.0 Sender: mreutegg@day.com Received: by 10.223.114.68 with SMTP id d4mr2670631faq.86.1235044100082; Thu, 19 Feb 2009 03:48:20 -0800 (PST) Date: Thu, 19 Feb 2009 12:48:20 +0100 X-Google-Sender-Auth: dcf30075c7dd36e5 Message-ID: Subject: Jackrabbit on Hadoop From: Marcel Reutegger To: dev@jackrabbit.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi all, I recently committed a PersistenceManager and a DataStore implementation based on Hadoop. It's in sandbox/jackrabbit-hadoop. The PersistenceManager uses Hadoop HBase, which is something similar to Googles BigTable. The data store implementation uses plain Hadoop HDFS. The performance is surprisingly good. On my machine the JCRAPITests take 40 seconds compared to 35 seconds with the default derby persistence manager. feedback welcome. regards marcel