hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jim X <jim.p...@gmail.com>
Subject Re: HTable.put(List<Put> puts) perform batch insert?
Date Sun, 30 Jan 2011 19:59:20 GMT
Which one do you use finally for batch process like JDBC batch?

On Tue, Jan 18, 2011 at 11:31 AM, Weishung Chung <weishung@gmail.com> wrote:
> Thank you, I will look into these packages :)
>
> On Sun, Jan 16, 2011 at 4:17 AM, Otis Gospodnetic <
> otis_gospodnetic@yahoo.com> wrote:
>
>> Hi,
>>
>> Re HBaseHUT - Alex didn't mention it, but he did a really nice and clear
>> writeup
>> of it in this post:
>>
>> http://blog.sematext.com/2010/12/16/deferring-processing-updates-to-increase-hbase-write-performance/
>>
>>
>> Otis
>> ----
>> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
>> Lucene ecosystem search :: http://search-lucene.com/
>>
>>
>>
>> ----- Original Message ----
>> > From: Alex Baranau <alex.baranov.v@gmail.com>
>> > To: user@hbase.apache.org
>> > Sent: Tue, January 11, 2011 10:51:28 AM
>> > Subject: Re: HTable.put(List<Put> puts) perform batch insert?
>> >
>> > Re HBaseHUT J-D was correct: you will gain speed with it in case you need
>> > Get  & Put operation to perform your updates.
>> >
>> > Don't forget to play with  writeToWAL, writeBuffer (with autoFlush=false)
>> > attributes!
>> >
>> > Alex  Baranau
>> > ----
>> > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - Hadoop -
>> HBase
>> >
>> > On  Mon, Jan 10, 2011 at 10:45 PM, Weishung Chung <weishung@gmail.com>
>> wrote:
>> >
>> > >  Ok, i will test it, thanks again :)
>> > >
>> > > On Mon, Jan 10, 2011 at 1:53  PM, Jean-Daniel Cryans <
>> jdcryans@apache.org
>> > >  >wrote:
>> > >
>> > > > Depending on the level of super fastness you  need, it may or may
not
>> > > > be fast enough. Better to test it, as  usual.
>> > > >
>> > > > J-D
>> > > >
>> > > > On Mon, Jan 10,  2011 at 11:12 AM, Weishung Chung <
>> weishung@gmail.com>
>> > > >  wrote:
>> > > > > Multiple batches of 10k *new/updated* rows at any time  to
>> different
>> > > > tables
>> > > > > by different clients  simultaneously. I want these multiple
batches
>> of
>> > > > > insertions to  be done super fast. At the same time, I would
like
>> to be
>> > > >  able
>> > > > > to scale up to 100k rows at a time (the goal).  Now,  I am
building
>> a
>> > > > cluster
>> > > > > of size 6 to 7  nodes.
>> > > > >
>> > > > > On Mon, Jan 10, 2011 at 1:03 PM,  Jean-Daniel Cryans <
>> > > jdcryans@apache.org
>> > > >  >wrote:
>> > > > >
>> > > > >> lotsa rows? That's 1k or 1B?  Inside a OLTP system or OLAP?
>> > > > >>
>> > > > >>  J-D
>> > > > >>
>> > > > >> On Mon, Jan 10, 2011 at 10:58  AM, Weishung Chung <
>> weishung@gmail.com>
>> > > >  >> wrote:
>> > > > >> > Jonathan, awesome, best of breed  APIs!
>> > > > >> > Jean, I would like to insert lotsa new rows  with many
columns
>> in a
>> > > > >> > particular column family*  **programmatically in batch
just like
>> the
>> > > > jdbc
>> > > >  >> > addBatch method.*
>> > > > >> > *Thanks  again.*
>> > > > >> >
>> > > > >> >
>> > > >  >>
>> > > > >
>> > > >
>> > >
>> >
>>
>

Mime
View raw message