drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "B Anil Kumar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-4173) Query did not return all documents if collection using a hashed shard key
Date Mon, 04 Jan 2016 11:56:39 GMT

    [ https://issues.apache.org/jira/browse/DRILL-4173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15081057#comment-15081057

B Anil Kumar commented on DRILL-4173:

[~ITDonald] Thanks for reporting this issue.

We are also able to reproduce issue, the root cause is, in sharded env, we construct a mapping
b/w chunk and Drillbit and assigned Drillbit will read the chunk based on minKey and maxKey

In the case of hashed sharding the minKey and maxKey is not matching with actual shardKey
values. That's why there is a data loss through Drill.

We are looking into this issue.

> Query did not return all documents if collection using a hashed shard key
> -------------------------------------------------------------------------
>                 Key: DRILL-4173
>                 URL: https://issues.apache.org/jira/browse/DRILL-4173
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - MongoDB
>    Affects Versions: 1.3.0
>         Environment: Windows 2012
>            Reporter: Yuqing Tang
>            Assignee: Jason Altekruse
> MongoDB 3.0.6
> If a collection used a hashed shard key ({ "shardkey": "hashed" }), queries like "select
* from ..." may not return all documents which should be returned from the collection.
> Test Case:
> Create 3 mongos, 3 config, 3 replicaset each with 3 mongod.
> Create one collection with hashed shard key.
> Insert 6 documents to this collection with shard key value 1,2,3,4,5,6
> Do a query select * from ....
> Only 2,3,4 will be returned.

This message was sent by Atlassian JIRA

View raw message