hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jan Lukavsky (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-12321) Delete#deleteColumn seems not to work with bulkload
Date Wed, 22 Oct 2014 14:14:34 GMT

    [ https://issues.apache.org/jira/browse/HBASE-12321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14179936#comment-14179936

Jan Lukavsky commented on HBASE-12321:

Basically, I want to delete the lastest version of the column. Don't get me wrong, I *know*
that the usage is wrong on the client side (correct usage is to use {{Delete#deleteColumns}}).
What I see as a problem is that everything seems to be working just fine, except for the fact,
that no data gets deleted. The combination of KeyValue.Type.Delete, HConstants.LATEST_TIMESTAMP
and bulk load is IMHO wrong in all cases and the client should be notfied about it.

> Delete#deleteColumn seems not to work with bulkload
> ---------------------------------------------------
>                 Key: HBASE-12321
>                 URL: https://issues.apache.org/jira/browse/HBASE-12321
>             Project: HBase
>          Issue Type: Bug
>          Components: Deletes, HFile, mapreduce
>    Affects Versions: 0.94.6
>            Reporter: Jan Lukavsky
>            Priority: Minor
> When using call to {{Delete#deleteColumn(byte[], byte[])}} to produce KeyValues that
are subsequently written to HFileOutputFormat and bulk loaded into HBase, the Delete seems
to be ignored. The reason for this is likely to be the missing (HConstants.LATEST_TIMESTAMP)
timestamp in the KeyValue with type {{KeyValue.Type.Delete}}. I think the RegionServer than
cannot delete the contents of the column due to mismatch in the timestamp.
> When using {{Delete#deleteColumns}} everything works fine, because of different type

This message was sent by Atlassian JIRA

View raw message