hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Subbu M Iyer (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-3488) Allow RowCounter to retrieve multiple versions of rows
Date Mon, 28 Mar 2011 17:38:05 GMT

    [ https://issues.apache.org/jira/browse/HBASE-3488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13012150#comment-13012150
] 

Subbu M Iyer commented on HBASE-3488:
-------------------------------------

First draft of CellCounter submitted for review. Warning: Very first draft. :-):-)


Test Setup
===========

1. Table planets with 5 CFs.
2. All CFs are setup with Max versions of 10.
3. Total 5 rows. First row has all 5 CFs and rest 4 rows has only CF1.
4. R1->CF1[c1-c9] (9 Quals), CF2[c21] (1 qual), CF3[c31] (1 qual), 
   CF4[c41] (1 qual), CF5[c51] (1 qual)
5. R2->CF1[c1-c9], R3->CF1[c1-c8], R4->CF1[cf9-> 9 versions], R5->CF1[c1-c5]

========================================================================

CellCounter lists the following stats:
======================================
1. Total number of rows in the table
2. Total number of CFs across all rows
3. Total qualifiers across all rows
4. Total occurances of each CF
5. Total occurances of each qualifier
6. Total number of versions of each qualifier.

==========================================================================
Running the CellCounter on the above setup produces the following report:
==========================================================================

Total Families Across all Rows  9
Total Qualifiers across all Rows        36
Total ROWS      5
cf1     5
cf1:col1        4
cf1:col2        4
cf1:col3        4
cf1:col4        4
cf1:col5        4
cf1:col6        3
cf1:col7        3
cf1:col8        3
cf1:col9        3
cf2     1
cf2:col21       1
cf3     1
cf3:col31       1
cf4     1
cf4:col41       1
cf5     1
cf5:col51       1
row_11:cf1:col1_Versions        1
row_11:cf1:col2_Versions        1
row_11:cf1:col3_Versions        1
row_11:cf1:col4_Versions        1
row_11:cf1:col5_Versions        1
row_11:cf1:col6_Versions        1
row_11:cf1:col7_Versions        1
row_11:cf1:col8_Versions        1
row_11:cf1:col9_Versions        1
row_11:cf2:col21_Versions       1
row_11:cf3:col31_Versions       1
row_11:cf4:col41_Versions       1
row_11:cf5:col51_Versions       1
row_22:cf1:col1_Versions        1
row_22:cf1:col2_Versions        1
row_22:cf1:col3_Versions        1
row_22:cf1:col4_Versions        1
row_22:cf1:col5_Versions        1
row_22:cf1:col6_Versions        1
row_22:cf1:col7_Versions        1
row_22:cf1:col8_Versions        1
row_22:cf1:col9_Versions        1
row_33:cf1:col1_Versions        1
row_33:cf1:col2_Versions        1
row_33:cf1:col3_Versions        1
row_33:cf1:col4_Versions        1
row_33:cf1:col5_Versions        1
row_33:cf1:col6_Versions        1
row_33:cf1:col7_Versions        1
row_33:cf1:col8_Versions        1
row_44:cf1:col9_Versions        9
row_55:cf1:col1_Versions        1
row_55:cf1:col2_Versions        1
row_55:cf1:col3_Versions        1
row_55:cf1:col4_Versions        1
row_55:cf1:col5_Versions        1
==============================================================================
 

> Allow RowCounter to retrieve multiple versions of rows
> ------------------------------------------------------
>
>                 Key: HBASE-3488
>                 URL: https://issues.apache.org/jira/browse/HBASE-3488
>             Project: HBase
>          Issue Type: Bug
>          Components: util
>    Affects Versions: 0.90.0
>            Reporter: Ted Yu
>             Fix For: 0.92.0
>
>
> Currently RowCounter only retrieves latest version for each row.
> Some applications would store multiple versions for the same row.
> RowCounter should accept a new parameter for the number of versions to return.
> Scan object would be configured with version parameter (for scan.maxVersions).
> Then the following API should be called:
> {code}
>   public KeyValue[] raw() {
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message