hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Malaska (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-14789) Provide an alternative spark-hbase connector
Date Wed, 11 Nov 2015 21:06:11 GMT

    [ https://issues.apache.org/jira/browse/HBASE-14789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001083#comment-15001083

Ted Malaska commented on HBASE-14789:

Hey Zhan,

I'm not sure I understand the question.

What I'm thinking is the changes you are asking for should fit nicely into the existing code.

And we can use the sub jira to discuss the implementations of each.  Example with the Scan
implementation I would like to ask if that functionality could be added to tableInputFormat
because it could be of value to more then just SparkSQL and because we can consolidate code.
 For the BulkGet implementation I would like to see some performance tests to make sure we
are not introducing latancy, also if we should use the existing BulkGet functionality in HBase-Spark
because we might want to execute the gets in more then one task. 

But lets have this discussions in the sub jiras, for they are completely different components
that are not dependent on each other.


> Provide an alternative spark-hbase connector
> --------------------------------------------
>                 Key: HBASE-14789
>                 URL: https://issues.apache.org/jira/browse/HBASE-14789
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Zhan Zhang
>            Assignee: Zhan Zhang
>         Attachments: shc.pdf
> This JIRA is to provide user an option to choose different Spark-HBase implementation
based on requirements.

This message was sent by Atlassian JIRA

View raw message