Return-Path: X-Original-To: apmail-incubator-hama-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-hama-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AE686956F for ; Tue, 27 Mar 2012 22:51:36 +0000 (UTC) Received: (qmail 46786 invoked by uid 500); 27 Mar 2012 22:51:36 -0000 Delivered-To: apmail-incubator-hama-user-archive@incubator.apache.org Received: (qmail 46767 invoked by uid 500); 27 Mar 2012 22:51:36 -0000 Mailing-List: contact hama-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hama-user@incubator.apache.org Delivered-To: mailing list hama-user@incubator.apache.org Received: (qmail 46759 invoked by uid 99); 27 Mar 2012 22:51:36 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 27 Mar 2012 22:51:36 +0000 Received: from localhost (HELO mail-qa0-f54.google.com) (127.0.0.1) (smtp-auth username edwardyoon, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Tue, 27 Mar 2012 22:51:36 +0000 Received: by qao25 with SMTP id 25so548886qao.6 for ; Tue, 27 Mar 2012 15:51:35 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=p4PqxSez8qLJ92uMuJRTSBFyrj1Lgy81gETok1AgVBI=; b=JpKvGb4A6N7CqlFmpJdjFhw7K7ZqreHSFuEdn8wSyrP0hUfFtbjz3vv7itJUzK6XiR LFrG+ySiL4FDz8+ctM23YI5p30RGY5wDB56ZisO/yRboMOHn5zBnYYt8L4dYw0HcIAl1 cHNAabpgjmYZ2DuJJUkypBXpRaSV1OiTBTR93y8PRm6UYuRrdIoVcfPp55U1QU0O5hL+ klOsipgH1b/VAiWgH4h4T4SwpcTcTX+5vVc8g6oDdMiENX+4VDNFhXZDndHkFR0P9xgz 54Iv5dR1dxIX4fH8vmuN5m0zA/kgUQJjiVBKrsRtkt63Wp/6sOx566m8CaTpotloH+mf fQgQ== MIME-Version: 1.0 Received: by 10.224.106.193 with SMTP id y1mr35123255qao.55.1332888695214; Tue, 27 Mar 2012 15:51:35 -0700 (PDT) Received: by 10.229.230.204 with HTTP; Tue, 27 Mar 2012 15:51:35 -0700 (PDT) In-Reply-To: <37BC8C61-3545-4112-B980-0833AC6E3EBE@cs.ucsc.edu> References: <37BC8C61-3545-4112-B980-0833AC6E3EBE@cs.ucsc.edu> Date: Wed, 28 Mar 2012 07:51:35 +0900 Message-ID: Subject: Re: multi-dimensional array storage From: "Edward J. Yoon" To: hama-user@incubator.apache.org Content-Type: text/plain; charset=UTF-8 X-Gm-Message-State: ALoCoQk/YUv/Ohy92L9T//rI94q0QEk4JCZD2khUpt69ZusTbRZIqQI1BdcIhdCyHfV2ogQWgWlH Hi, I believe that HBase is the best way to store multi-dimensional arrays. HBase provides storage efficiencies as number of dimensions grow, ordering capability, and also allows you to record and access data corrections and updates directly via HBase client library. Another option is use of SequenceFile and MapFile. Once data loaded to the program initially, your math operations can run directly in memory and and synchronized using a standard BSP APIs. Thanks. On Wed, Mar 28, 2012 at 12:46 AM, Noah Watkins wrote: > Hi Hama list, > > I'm interested in using Hama to process large multi-dimensional arrays (sparse and dense). What is the best way to store and represent this type of data for processing in Hama? > > Thanks, > Noah -- Best Regards, Edward J. Yoon @eddieyoon