Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 76001 invoked from network); 21 Oct 2008 02:43:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 21 Oct 2008 02:43:51 -0000 Received: (qmail 13680 invoked by uid 500); 21 Oct 2008 02:43:49 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 13390 invoked by uid 500); 21 Oct 2008 02:43:48 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Delivered-To: moderator for core-user@hadoop.apache.org Received: (qmail 80199 invoked by uid 99); 21 Oct 2008 01:58:22 -0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of hyunsikchoi@korea.ac.kr designates 163.152.6.123 as permitted sender) X-SmartFilter: UNIQ,A4426A8110EF06CA564860EA8D301651,m9L1wvN0025F X-SmartFilter: FROM,m9L1wvN0025F,hyunsikchoi@korea.ac.kr Subject: Re: A Scale-Out RDF Store for Distributed Processing on Map/Reduce From: Hyunsik Choi Reply-To: hyunsikchoi@korea.ac.kr To: core-user@hadoop.apache.org In-Reply-To: <48FD2EF5.4060403@metaweb.com> References: <48FD2EF5.4060403@metaweb.com> Content-Type: text/plain Organization: Database & Information Systems Lab. Date: Tue, 21 Oct 2008 10:57:40 +0900 Message-Id: <1224554260.7507.17.camel@code> Mime-Version: 1.0 X-Mailer: Evolution 2.24.0 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi Colin, I'm a member of RDF proposal. I have one question as to Metaweb. Do you intend to make Metaweb open source? Hyunsik Choi On Mon, 2008-10-20 at 18:23 -0700, Colin Evans wrote: > Hi Edward, > At Metaweb, we're experimenting with storing raw triples in HDFS flat > files, and have written a simple query language and planner that > executes the queries with chained map-reduce jobs. This approach works > well for warehousing triple data, and doesn't require HBase. Queries > may take a few minutes to execute, but the system scales for very large > datasets and result sets because it doesn't try to resolve queries in > memory. We're currently testing with more than 150MM triples and have > been happy with the results. > > -Colin > > > Edward J. Yoon wrote: > > Hi all, > > > > This RDF proposal is a good long time ago. Now we'd like to settle > > down to research again. I attached our proposal, We'd love to hear > > your feedback & stories!! > > > > Thanks. > > > -- ----------------------------------------------------------------- Hyunsik Choi (Ph.D Student) Laboratory of Prof. Yon Dohn Chung Database & Information Systems Group Dept. of Computer Science & Engineering, Korea University 1, 5-ga, Anam-dong, Seongbuk-gu, Seoul, 136-713, Republic of Korea TEL : +82-2-3290-3580 -----------------------------------------------------------------