hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Elser (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-13262) ResultScanner doesn't return all rows in Scan
Date Thu, 19 Mar 2015 05:05:38 GMT

    [ https://issues.apache.org/jira/browse/HBASE-13262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14368524#comment-14368524
] 

Josh Elser commented on HBASE-13262:
------------------------------------

Turns out the ever-so-slight variance between sizes on server and client is there from {{KeyValue#heapSize()}}.

{panel:title=RegionServer}
{noformat}
206228/cf:0/1426716998482/Put/vlen=50/seqid=41, size=152, length=81
KeyLength=21, TagsLength=0, RowLength=6, FamilyLength=2, QualiferLength=1
{noformat}
{panel}

{panel:title=Client}
{noformat}
206228/cf:0/1426716998482/Put/vlen=50/seqid=0, Size=144, Length=79
KeyLength=21, TagsLength=0, RowLength=6, FamilyLength=2, QualiferLength=1
{noformat}
{panel}

Best as I understand, the seqId should be causing a difference in the calculation of {{heapSize()}}.
Aside from that, I haven't been able to find a fundamental difference in the underlying {{byte[]}}
of the {{KeyValue}} on client and server. My present hunch is that the length of the array
happens to be 2-bytes longer than the data in the array and somehow those two extra bytes
get lopped off in the process of being sent from server to client? Just a guess at this point
-- will hook up a debugger again tmrw to see if I can figure out exactly why the length is
longer but the objects appear to be functionally equivalent.

> ResultScanner doesn't return all rows in Scan
> ---------------------------------------------
>
>                 Key: HBASE-13262
>                 URL: https://issues.apache.org/jira/browse/HBASE-13262
>             Project: HBase
>          Issue Type: Bug
>          Components: Client
>    Affects Versions: 2.0.0, 1.1.0
>         Environment: Single node, pseduo-distributed 1.1.0-SNAPSHOT
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>            Priority: Blocker
>             Fix For: 2.0.0, 1.1.0
>
>         Attachments: testrun_0.98.txt, testrun_branch1.0.txt
>
>
> Tried to write a simple Java client again 1.1.0-SNAPSHOT.
> * Write 1M rows, each row with 1 family, and 10 qualifiers (values [0-9]), for a total
of 10M cells written
> * Read back the data from the table, ensure I saw 10M cells
> Running it against {{04ac1891}} (and earlier) yesterday, I would get ~20% of the actual
rows. Running against 1.0.0, returns all 10M records as expected.
> [Code I was running|https://github.com/joshelser/hbase-hwhat/blob/master/src/main/java/hbase/HBaseTest.java]
for the curious.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message