hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yi Liang (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-17905) [hbase-spark] bulkload does not work when table not exist
Date Tue, 11 Apr 2017 22:39:42 GMT

     [ https://issues.apache.org/jira/browse/HBASE-17905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Yi Liang updated HBASE-17905:
    Attachment: HBASE-17905-V1.patch

The reason why bulkload fail when table not exist is the {code}BulkLoadPatitioner#numPartitions
{code} will return 0 if table not exist, if partition number equals 0, it means that spark
repartitionAndSortWithinPartitions(Partitioner) will return nothing, and the following transformation
will not be executed

This patch fix the errors for bulkload fail when table not exist, and also add some log information,
I wonder if we can also add a BulkLoad API that do not have tablename as parameter 

> [hbase-spark]  bulkload does not work when table not exist
> ----------------------------------------------------------
>                 Key: HBASE-17905
>                 URL: https://issues.apache.org/jira/browse/HBASE-17905
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Yi Liang
>            Assignee: Yi Liang
>         Attachments: HBASE-17905-V1.patch
> when using HBase-Spark bulkload api, an argument of tablename is needed, the bulkload
can run successfully only if  table exist in HBase.  If table not exist, the bulkload can
not run successfully and it even do not report any errors or throw exception. 

This message was sent by Atlassian JIRA

View raw message