Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5F6AA200C4C for ; Tue, 4 Apr 2017 11:42:48 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 5DFD7160B81; Tue, 4 Apr 2017 09:42:48 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 9BEEC160BA5 for ; Tue, 4 Apr 2017 11:42:47 +0200 (CEST) Received: (qmail 52725 invoked by uid 500); 4 Apr 2017 09:42:46 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 52654 invoked by uid 99); 4 Apr 2017 09:42:46 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Apr 2017 09:42:46 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 48ED018063D for ; Tue, 4 Apr 2017 09:42:46 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id Dlos2_-J5weP for ; Tue, 4 Apr 2017 09:42:45 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 908AF5FC90 for ; Tue, 4 Apr 2017 09:42:44 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id E40B8E0D22 for ; Tue, 4 Apr 2017 09:42:43 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 37C2D24021 for ; Tue, 4 Apr 2017 09:42:42 +0000 (UTC) Date: Tue, 4 Apr 2017 09:42:42 +0000 (UTC) From: =?utf-8?Q?Lucas_Alvarez_Arg=C3=BCero_=28JIRA=29?= To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (DRILL-5411) Getting 0 rows when there are more than 100000 in the mongoDB collection MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 04 Apr 2017 09:42:48 -0000 Lucas Alvarez Arg=C3=BCero created DRILL-5411: -------------------------------------------- Summary: Getting 0 rows when there are more than 100000 in the= mongoDB collection Key: DRILL-5411 URL: https://issues.apache.org/jira/browse/DRILL-5411 Project: Apache Drill Issue Type: Bug Components: Storage - MongoDB Affects Versions: 1.10.0 Environment: VM1("ubuntu/trusty64"): mongo1 =E2=80=A2=09mongoS (mongo server) =E2=80=A2=09MongoD shard1 (Primary, secondary,secondary) =E2=80=A2=09Mongo config server=20 =E2=80=A2=09Drillbit VM2("ubuntu/trusty64"): mongo2 =E2=80=A2=09MongoD shard2 (Primary, secondary,secondary) =E2=80=A2=09Mongo config server=20 =E2=80=A2=09Drillbit VM3("ubuntu/trusty64"): mongo3 =E2=80=A2=09MongoD shard3 (Primary, secondary,secondary) =E2=80=A2=09Mongo config server=20 =E2=80=A2=09Drillbit VM4("ubuntu/trusty64"): zk1 =E2=80=A2=09Zookeeper in quorum VM5("ubuntu/trusty64"): zk2 =E2=80=A2=09Zookeeper in quorum VM6("ubuntu/trusty64"): zk3 =E2=80=A2=09Zookeeper in quorum Reporter: Lucas Alvarez Arg=C3=BCero Getting 0 rows when there are more than 100000 in the mongoDB collection Drills works perfectly when I am using mongo as storage when there are less= than 100000(aprox) documents in the collection (partitioned) but when ther= e are more documents, drill return zero rows but still can count all docume= nts (but it can=E2=80=99t count documents using where). Less than 100000: select v.measInfo_id,v.endTime from mongo.mandarinaTime3.MeasValue v lim= it 3;=20 +--------------+-------------+ | measInfo_id | endTime | +--------------+-------------+ | [B@1a7d4b45 | 2016-09-19 | | [B@17d8ac99 | 2016-09-19 | | [B@122b7d0a | 2016-09-19 | +--------------+-------------+ 3 rows selected (0.313 seconds) More than 100000: 0: jdbc:drill:> select v.measInfo_id,v.endTime from mongo.mandarinaTime3.Me= asValue v limit 3; = =20 +--------------+----------+ | measInfo_id | endTime | +--------------+----------+ +--------------+----------+ No rows selected (0.341 seconds) 0: jdbc:drill:> select count() from mongo.mandarinaTime3.MeasValue v ; = =20 +---------+ | EXPR$0 | +---------+ | 502068 | +---------+ 1 row selected (0.426 seconds) 0: jdbc:drill:> select count() from mongo.mandarinaTime3.MeasValue v Whe= re endtime=3D'2016-09-19'; +---------+ | EXPR$0 | +---------+ | 0 | +---------+ 1 row selected (0.98 seconds) If the collection isn=E2=80=99t partitioned, drill also works perfectly drill mongo plugin: { "type": "mongo", "connection": "mongodb://mongo1:27017/", "enabled": true } mongo sharded collection: { "_id" : "mandarinaTime3", "primary" : "b", "partitioned" : true } mandarinaTime3.MeasCollecFile shard key: { "_id" : 1 } unique: false balancing: true chunks: b 1 { "_id" : { "$minKey" : 1 } } -->> { "_id" : { "$ma= xKey" : 1 } } on : b Timestamp(1, 0)=20 mandarinaTime3.MeasInfo shard key: { "_id" : 1 } unique: false balancing: true chunks: a 1 b 1 c 1 { "_id" : { "$minKey" : 1 } } -->> { "_id" : Object= Id("58e364dddc7a033f5c08c7c6") } on : a Timestamp(2, 0)=20 { "_id" : ObjectId("58e364dddc7a033f5c08c7c6") } --= >> { "_id" : ObjectId("58e364e0dc7a033f5c08c8b0") } on : c Timestamp(3, 0)= =20 { "_id" : ObjectId("58e364e0dc7a033f5c08c8b0") } --= >> { "_id" : { "$maxKey" : 1 } } on : b Timestamp(3, 1)=20 mandarinaTime3.MeasValue shard key: { "_id" : 1 } unique: false balancing: true chunks: a 7 b 7 c 7 too many chunks to print, use verbose if you want t= o force print -- This message was sent by Atlassian JIRA (v6.3.15#6346)