bahir-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From emlaver <...@git.apache.org>
Subject [GitHub] bahir pull request #41: [BAHIR-102] Initial support of Cloudant Query with p...
Date Mon, 10 Apr 2017 16:52:49 GMT
Github user emlaver commented on a diff in the pull request:

    https://github.com/apache/bahir/pull/41#discussion_r110706772
  
    --- Diff: sql-cloudant/README.md ---
    @@ -62,6 +62,8 @@ cloudant.protocol|https|protocol to use to transfer data: http or https
     cloudant.host||cloudant host url
     cloudant.username||cloudant userid
     cloudant.password||cloudant password
    +cloudant.useQuery|false|When enabled, for query not using index or view, _find will be
used instead of _all_docs, some query predicates will be driven into datastore. However, RDD
partition is ONE during _find, so parallel loading is not achieved
    --- End diff --
    
    Ok - I'm still having a hard time understanding the statement `some of the query predicates
will be driven into datastore`.  What do you mean by driven? Does it have to do with `partitions
used to drive JsonStoreRDD loading query result in parallel` ?
    Also, `As RDD partition is ONE during _find` - one what? set to one partition for the
`jsonstore.rdd.partitions` configuration? 
    And for `parallel loading is not achieved` - This sounds awkward to me.  Is it not achieved
because the `_find` endpoint only supports one partition? 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message