accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Busbey (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-1892) examples.simple.RandomBatchWriter might not write the specified number of rowids
Date Tue, 19 Nov 2013 18:19:20 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-1892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13826777#comment-13826777
] 

Sean Busbey commented on ACCUMULO-1892:
---------------------------------------

Yeah, 1.5.1-SNAPSHOT, 1.6.0-SNAPSHOT, and master all look correct now.

> examples.simple.RandomBatchWriter might not write the specified number of rowids
> --------------------------------------------------------------------------------
>
>                 Key: ACCUMULO-1892
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-1892
>             Project: Accumulo
>          Issue Type: Bug
>          Components: docs, test
>    Affects Versions: 1.4.0, 1.4.1, 1.4.2, 1.4.3, 1.4.4, 1.5.0
>            Reporter: Sean Busbey
>            Assignee: Sean Busbey
>            Priority: Minor
>             Fix For: 1.4.5, 1.5.1, 1.6.0
>
>         Attachments: ACCUMULO-1892.1.patch.txt, ACCUMULO-1892.2.patch.txt, ACCUMULO-1892.3.patch.txt
>
>
> examples.simple.RandomBatchWriter takes a command line arg for how many rowids to write.
> In the functional tests, this arg is used to coordinate a number of expected reads with
the examples.simple.RandomBatchReader example.
> the reader implementation ensures that it generates the passed number of query rows,
but the write just makes num attempts at creating random rowids.
> For a small number of goal rows and a large range (like the examples used for testing
bloom filter speeds early in the examples integration test), this works fine.
> For a larger number of goal rows (like the examples used to just write/read 10k rows
later in the examples integration test), collisions in the selected row ids will cause the
writer to actually generate fewer than the expected number. That, in turn, will  cause the
attempted read integration test to fail.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message