hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pan, Thomas" <th...@ebay.com>
Subject Re: Scan performance on a big table as combination of multiple logic tables
Date Fri, 17 Feb 2012 21:26:55 GMT

Currently, bulk load is for bootstrapping the table(s) while random write
is the way to go, which we could assume that the operations are evenly
distributed across the time for all the column families. -Thomas

On 2/15/12 2:07 PM, "Stack" <stack@duboce.net> wrote:

>On Wed, Feb 15, 2012 at 2:02 PM, Todd Lipcon <todd@cloudera.com> wrote:
>> Hi Thomas,
>> The issue with combining multiple tables into different CFs of one
>> table is that the tables will get tied together for flush/compact
>> operations. If the workload between them differs significantly you
>> might introduce bad inefficiency for one or the other. See HBASE-3149.
>Are the two column families bulk loaded at the same time Thomas?
>Updates come in as trickles over the API but main loading is via bulk
>load (across the multiple column families?)?

View raw message