hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From hdev ml <hde...@gmail.com>
Subject Re: HBase Region splitting may times.
Date Fri, 16 Mar 2012 17:40:06 GMT
Does anybody have an answer to this?

Please let me know.

Thanks
Harshad

On Mar 15, 2012, at 11:12 PM, hdev ml wrote:
>>
>> > Hi,
>> >
>> > We are using HBase version 0.90.3 in a 2 node cluster. Maybe this
>> question
>> > has been asked too many times. But I could not find a good answer for
>> this.
>> >
>> > I created a test table with one column family "cf" with 2 columns "a"
>> and
>> > "b", each having value of a 3000 character long string. Maximum versions
>> > allowed is 3 and maxfilesize is at default 256M.
>> >
>> > In a loop, I put 100000 rows into it, with 3000 character long values
>> for
>> > both a and b. Row key is incremental like row00000000 to row00099999.
>> >
>> > I applied an outer loop which will run the above 100000 row put loop, 10
>> > times.
>> >
>> > After running it 10 times, I found that it split into following number
>> of
>> > regions for every run.
>> >
>> > Run     Regions
>> > 1            4
>> > 2            5
>> > 3            7
>> > 4           10
>> > 5           13
>> > 6           19
>> > 7           19
>> > 8           19
>> > 9           19
>> > 10          19
>> >
>> > Question is, why did it stabilize after the 6th run? Shouldn't it
>> stabilize
>> > after 3 runs, because number of versions is 3? After 3 runs, It should
>> not
>> > split further, because new versions are being added but old version
>> should
>> > be purged/deleted. Is that a correct statement?
>> >
>> > Any help is really appreciated.
>> >
>> > Thanks,
>> > Harshad
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message