drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Carrot Hu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-3491) SELECT COUNT(*) FROM HBASE Returns Incorrect Row Count
Date Tue, 14 Jul 2015 01:40:05 GMT

     [ https://issues.apache.org/jira/browse/DRILL-3491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Carrot Hu updated DRILL-3491:
-----------------------------
    Description: 
Create a table 'test' in Hbase with 1 column family, 7 columns.
Inserting 100,000 rows into 'test' using Java API, each column with same value = "value".

SELECT COUNT(<all>) FROM hbase.test
returns an incorrect row count.

SELECT COUNT(row_key) is correct,
SELECT COUNT(<Any subset of the columns>) is also correct.

Clear the table, and changed to inserting 1000 rows, keep the number of columns, Drill returns
the right count. But when increasing the number of columns to 30. SELLECT COUNT(<all>)
returns an incorrect row count (only 673).

  was:
Create a table 'test' in Hbase with 1 column family, 7 columns.
Inserting 100,000 rows into 'test' using Java API, each column with same value "value".

SELECT COUNT(*) FROM hbase.test
returns an incorrect row count.

SELECT COUNT(row_key) is correct.
SELECT COUNT(<Any subset of the columns>) is also correct.

Clear the table, and changed to inserting 1000 rows, keep the number of columns, Drill returns
the right count. 

Clear the table, sitll insert 1000 rows but increase the number of columns to 30. SELLECT
COUNT(*) returns an incorrect row count (only 673).


> SELECT COUNT(*) FROM HBASE Returns Incorrect Row Count
> ------------------------------------------------------
>
>                 Key: DRILL-3491
>                 URL: https://issues.apache.org/jira/browse/DRILL-3491
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - HBase
>    Affects Versions: 1.0.0, 1.1.0
>         Environment: CentOS6.5
> jdk1.8.0_45
> hadoop-2.6.0-cdh5.4.2
> hbase-1.0.0-cdh5.4.2
> IntelliJ14.1.4
> Maven3.0.5
>            Reporter: Carrot Hu
>            Assignee: Aditya Kishore
>              Labels: hbase, sql
>
> Create a table 'test' in Hbase with 1 column family, 7 columns.
> Inserting 100,000 rows into 'test' using Java API, each column with same value = "value".
> SELECT COUNT(<all>) FROM hbase.test
> returns an incorrect row count.
> SELECT COUNT(row_key) is correct,
> SELECT COUNT(<Any subset of the columns>) is also correct.
> Clear the table, and changed to inserting 1000 rows, keep the number of columns, Drill
returns the right count. But when increasing the number of columns to 30. SELLECT COUNT(<all>)
returns an incorrect row count (only 673).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message