Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CA34591DA for ; Tue, 8 May 2012 17:05:50 +0000 (UTC) Received: (qmail 58000 invoked by uid 500); 8 May 2012 17:05:48 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 57942 invoked by uid 500); 8 May 2012 17:05:48 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 57934 invoked by uid 99); 8 May 2012 17:05:48 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 May 2012 17:05:48 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of doug.meil@explorysmedical.com designates 213.199.154.144 as permitted sender) Received: from [213.199.154.144] (HELO db3outboundpool.messaging.microsoft.com) (213.199.154.144) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 May 2012 17:05:40 +0000 Received: from mail37-db3-R.bigfish.com (10.3.81.236) by DB3EHSOBE003.bigfish.com (10.3.84.23) with Microsoft SMTP Server id 14.1.225.23; Tue, 8 May 2012 17:05:03 +0000 Received: from mail37-db3 (localhost [127.0.0.1]) by mail37-db3-R.bigfish.com (Postfix) with ESMTP id 7AE3118041C for ; Tue, 8 May 2012 17:05:03 +0000 (UTC) X-SpamScore: -13 X-BigFish: PS-13(zzbb2dI9371I14ffI1432N98dK4015Izz1202hzz8275dhz2fh2a8h668h839h944he5bh) X-Forefront-Antispam-Report: CIP:157.56.244.181;KIP:(null);UIP:(null);IPV:NLI;H:CH1PRD0410HT005.namprd04.prod.outlook.com;RD:none;EFVD:NLI Received-SPF: pass (mail37-db3: domain of explorysmedical.com designates 157.56.244.181 as permitted sender) client-ip=157.56.244.181; envelope-from=doug.meil@explorysmedical.com; helo=CH1PRD0410HT005.namprd04.prod.outlook.com ;.outlook.com ; Received: from mail37-db3 (localhost.localdomain [127.0.0.1]) by mail37-db3 (MessageSwitch) id 1336496701479422_23484; Tue, 8 May 2012 17:05:01 +0000 (UTC) Received: from DB3EHSMHS019.bigfish.com (unknown [10.3.81.238]) by mail37-db3.bigfish.com (Postfix) with ESMTP id 7168A4A007B for ; Tue, 8 May 2012 17:05:01 +0000 (UTC) Received: from CH1PRD0410HT005.namprd04.prod.outlook.com (157.56.244.181) by DB3EHSMHS019.bigfish.com (10.3.87.119) with Microsoft SMTP Server (TLS) id 14.1.225.23; Tue, 8 May 2012 17:04:59 +0000 Received: from CH1PRD0410MB392.namprd04.prod.outlook.com ([169.254.11.13]) by CH1PRD0410HT005.namprd04.prod.outlook.com ([10.255.147.40]) with mapi id 14.16.0152.000; Tue, 8 May 2012 17:05:12 +0000 From: Doug Meil To: "user@hbase.apache.org" Subject: Re: Sequential read-update HFile Thread-Topic: Sequential read-update HFile Thread-Index: AQHNLTUJ6Gwb9+MM+0aqUmpMIwsm2pa/2/YA Date: Tue, 8 May 2012 17:05:12 +0000 Message-ID: In-Reply-To: <4FA943F7.2020703@cslab.ntua.gr> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.255.147.4] Content-Type: text/plain; charset="us-ascii" Content-ID: <738A39AEEC275B40B2BA3F784E04B4DC@namprd04.prod.outlook.com> Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginatorOrg: explorysmedical.com Once a StoreFile is written to disk it is never updated. http://hbase.apache.org/book.html#regions.arch So the options are are either Puts (like you've been describing - which will create new StoreFiles per MemStore flush) or bulk loading HFiles (new StoreFiles). Out of curiosity, are you trying to update *every* row? Most of the rows? =20 On 5/8/12 12:04 PM, "Nikolaos Papailiou" wrote: >Hi all, > >I would like to sequentially read and update some HFiles. That means >that I want something like a HFileScanner but with the capability of >directly updating the current key/value pair. I have been trying to do >that with sequential puts and major compactions of the region but the >performance is very poor. Is there another way to do this efficiently? > >Thanks, >Nikos >