Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5E1ABFEE9 for ; Mon, 25 Mar 2013 19:39:42 +0000 (UTC) Received: (qmail 56357 invoked by uid 500); 25 Mar 2013 19:39:40 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 56256 invoked by uid 500); 25 Mar 2013 19:39:40 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 56247 invoked by uid 99); 25 Mar 2013 19:39:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Mar 2013 19:39:40 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of yuzhihong@gmail.com designates 209.85.212.172 as permitted sender) Received: from [209.85.212.172] (HELO mail-wi0-f172.google.com) (209.85.212.172) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Mar 2013 19:39:33 +0000 Received: by mail-wi0-f172.google.com with SMTP id hr17so3450643wib.17 for ; Mon, 25 Mar 2013 12:39:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=0Hli0VwxuUcCwcohiLbGwXrH9tIW6pzPI8XWKFB7s8w=; b=axrpwpyQhdlwsrs9X9y0GFiI9MJefpnvhHgHEocf6CBO2s4HguA2COlX6VXe1jxVtz sKkpZAX0n5Zsp8tbUPO6K9cI3t+d/7W1S0TeFIuNTNNU2XJajagC3FjtbdXOe2VD8QJ+ +Lt/s1zIMUUglfgGEG7eZE+Nt4guyGDPXrro1rZTafMvncd9gQc++kFK6TyMVecZ5TGO 1ZM3TVBinc3s5S/Zg9l9lvj/X5FsBIh9fXFuAr+kW3LljF3XMo6hudM0mQibXjREQdVt 1qwxIVTbmlgdoYVJPD6wHtrF/g2GLcszPJtcCdJrwQ3Nfz2EF2MpAc3hg9k0cIWV9xop fmlQ== MIME-Version: 1.0 X-Received: by 10.194.92.231 with SMTP id cp7mr20247667wjb.19.1364240353379; Mon, 25 Mar 2013 12:39:13 -0700 (PDT) Received: by 10.194.35.130 with HTTP; Mon, 25 Mar 2013 12:39:13 -0700 (PDT) In-Reply-To: <51BE77A7231E9E488E58380AFBD7727225EF1983@mail1.impetus.co.in> References: <51BE77A7231E9E488E58380AFBD7727225EF1879@mail1.impetus.co.in> <51BE77A7231E9E488E58380AFBD7727225EF18C6@mail1.impetus.co.in> <51BE77A7231E9E488E58380AFBD7727225EF194C@mail1.impetus.co.in> <51BE77A7231E9E488E58380AFBD7727225EF1983@mail1.impetus.co.in> Date: Mon, 25 Mar 2013 12:39:13 -0700 Message-ID: Subject: Re: HBase Writes With Large Number of Columns From: Ted Yu To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=047d7bfd0c2ea6cc0d04d8c4f5ab X-Virus-Checked: Checked by ClamAV on apache.org --047d7bfd0c2ea6cc0d04d8c4f5ab Content-Type: text/plain; charset=ISO-8859-1 bq. These records are being written using batch mutation with thrift API This is an important information, I think. Batch mutation through Java API would incur lower overhead. On Mon, Mar 25, 2013 at 11:40 AM, Pankaj Misra wrote: > Firstly, Thanks a lot Jean and Ted for your extended help, very much > appreciate it. > > Yes Ted I am writing to all the 40 columns and 1.5 Kb of record data is > distributed across these columns. > > Jean, some columns are storing as small as a single byte value, while few > of the columns are storing as much as 80-125 bytes of data. The overall > record size is 1.5 KB. These records are being written using batch mutation > with thrift API, where in we are writing 100 records per batch mutation. > > Thanks and Regards > Pankaj Misra > > > ________________________________________ > From: Jean-Marc Spaggiari [jean-marc@spaggiari.org] > Sent: Monday, March 25, 2013 11:57 PM > To: user@hbase.apache.org > Subject: Re: HBase Writes With Large Number of Columns > > I just ran some LoadTest to see if I can reproduce that. > > bin/hbase org.apache.hadoop.hbase.util.LoadTestTool -write 4:512:100 > -num_keys 1000000 > 13/03/25 14:18:25 INFO util.MultiThreadedAction: [W:100] Keys=997172, > cols=3,8m, time=00:03:55 Overall: [keys/s= 4242, latency=23 ms] > Current: [keys/s=4413, latency=22 ms], insertedUpTo=-1 > > bin/hbase org.apache.hadoop.hbase.util.LoadTestTool -write 100:512:100 > -num_keys 1000000 > > This one crashed because I don't have enought disk space, so I'm > re-running it, but just before it crashed it was showing about 24.5 > slower. which is coherent since it's writing 25 more columns. > > What size of data do you have? Big cells? Small cells? I will retry > the test above with more lines and keep you posted. > > 2013/3/25 Pankaj Misra : > > Yes Ted, you are right, we are having table regions pre-split, and we > see that both regions are almost evenly filled in both the tests. > > > > This does not seem to be a regression though, since we were getting good > write rates when we had lesser number of columns. > > > > Thanks and Regards > > Pankaj Misra > > > > > > ________________________________________ > > From: Ted Yu [yuzhihong@gmail.com] > > Sent: Monday, March 25, 2013 11:15 PM > > To: user@hbase.apache.org > > Cc: ankitjaincs06@gmail.com > > Subject: Re: HBase Writes With Large Number of Columns > > > > Copying Ankit who raised the same question soon after Pankaj's initial > > question. > > > > On one hand I wonder if this was a regression in 0.94.5 (though > unlikely). > > > > Did the region servers receive (relatively) same write load for the > second > > test case ? I assume you have pre-split your tables in both cases. > > > > Cheers > > > > On Mon, Mar 25, 2013 at 10:18 AM, Pankaj Misra > > wrote: > > > >> Hi Ted, > >> > >> Sorry for missing that detail, we are using HBase version 0.94.5 > >> > >> Regards > >> Pankaj Misra > >> > >> > >> ________________________________________ > >> From: Ted Yu [yuzhihong@gmail.com] > >> Sent: Monday, March 25, 2013 10:29 PM > >> To: user@hbase.apache.org > >> Subject: Re: HBase Writes With Large Number of Columns > >> > >> If you give us the version of HBase you're using, that would give us > some > >> more information to help you. > >> > >> Cheers > >> > >> On Mon, Mar 25, 2013 at 9:55 AM, Pankaj Misra < > pankaj.misra@impetus.co.in > >> >wrote: > >> > >> > Hi, > >> > > >> > The issue that I am facing is around the performance drop of Hbase, > when > >> I > >> > was having 20 columns in a column family Vs now when I am having 40 > >> columns > >> > in a column family. The number of columns have doubled and the > >> > ingestion/write speed has also dropped by half. I am writing 1.5 KB of > >> data > >> > per row across 40 columns. > >> > > >> > Are there any settings that I should look into for tweaking Hbase to > >> write > >> > higher number of columns faster? > >> > > >> > I would request community's help to let me know how can I write to a > >> > column family with large number of columns efficiently. > >> > > >> > Would greatly appreciate any help /clues around this issue. > >> > > >> > Thanks and Regards > >> > Pankaj Misra > >> > > >> > ________________________________ > >> > > >> > > >> > > >> > > >> > > >> > > >> > NOTE: This message may contain information that is confidential, > >> > proprietary, privileged or otherwise protected by law. The message is > >> > intended solely for the named addressee. If received in error, please > >> > destroy and notify the sender. Any use of this email is prohibited > when > >> > received in error. Impetus does not represent, warrant and/or > guarantee, > >> > that the integrity of this communication has been maintained nor that > the > >> > communication is free of errors, virus, interception or interference. > >> > > >> > >> ________________________________ > >> > >> > >> > >> > >> > >> > >> NOTE: This message may contain information that is confidential, > >> proprietary, privileged or otherwise protected by law. The message is > >> intended solely for the named addressee. If received in error, please > >> destroy and notify the sender. Any use of this email is prohibited when > >> received in error. Impetus does not represent, warrant and/or guarantee, > >> that the integrity of this communication has been maintained nor that > the > >> communication is free of errors, virus, interception or interference. > >> > > > > ________________________________ > > > > > > > > > > > > > > NOTE: This message may contain information that is confidential, > proprietary, privileged or otherwise protected by law. The message is > intended solely for the named addressee. If received in error, please > destroy and notify the sender. Any use of this email is prohibited when > received in error. Impetus does not represent, warrant and/or guarantee, > that the integrity of this communication has been maintained nor that the > communication is free of errors, virus, interception or interference. > > ________________________________ > > > > > > > NOTE: This message may contain information that is confidential, > proprietary, privileged or otherwise protected by law. The message is > intended solely for the named addressee. If received in error, please > destroy and notify the sender. Any use of this email is prohibited when > received in error. Impetus does not represent, warrant and/or guarantee, > that the integrity of this communication has been maintained nor that the > communication is free of errors, virus, interception or interference. > --047d7bfd0c2ea6cc0d04d8c4f5ab--