hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-8753) Provide new delete flag which can delete all cells under a column-family which have a same designated timestamp
Date Mon, 08 Jul 2013 14:45:49 GMT

    [ https://issues.apache.org/jira/browse/HBASE-8753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13702036#comment-13702036
] 

Hadoop QA commented on HBASE-8753:
----------------------------------

{color:red}-1 overall{color}.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12591210/8753-trunk-v4.txt
  against trunk revision .

    {color:green}+1 @author{color}.  The patch does not contain any @author tags.

    {color:green}+1 tests included{color}.  The patch appears to include 6 new or modified
tests.

    {color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 1.0 profile.

    {color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 2.0 profile.

    {color:red}-1 javadoc{color}.  The javadoc tool appears to have generated 2 warning messages.

    {color:green}+1 javac{color}.  The applied patch does not increase the total number of
javac compiler warnings.

    {color:green}+1 findbugs{color}.  The patch does not introduce any new Findbugs (version
1.3.9) warnings.

    {color:green}+1 release audit{color}.  The applied patch does not increase the total number
of release audit warnings.

    {color:red}-1 lineLengths{color}.  The patch introduces lines longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

    {color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//testReport/
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/6238//console

This message is automatically generated.
                
> Provide new delete flag which can delete all cells under a column-family which have a
same designated timestamp
> ---------------------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-8753
>                 URL: https://issues.apache.org/jira/browse/HBASE-8753
>             Project: HBase
>          Issue Type: New Feature
>          Components: Deletes, Scanners
>    Affects Versions: 0.95.1
>            Reporter: Feng Honghua
>            Assignee: Feng Honghua
>         Attachments: 8753-trunk-V2.patch, 8753-trunk-v4.txt, HBASE-8753-0.94-V0.patch,
HBASE-8753-0.94-V1.patch, HBASE-8753-trunk-V0.patch, HBASE-8753-trunk-V1.patch, HBASE-8753-trunk-V3.patch
>
>
> In one of our production scenario (Xiaomi message search), multiple cells will be put
in batch using a same timestamp with different column names under a specific column-family.

> And after some time these cells also need to be deleted in batch by given a specific
timestamp. But the column names are parsed tokens which can be arbitrary words , so such batch
delete is impossible without first retrieving all KVs from that CF and get the column name
list which has KV with that given timestamp, and then issuing individual deleteColumn for
each column in that column-list.
> Though it's possible to do such batch delete, its performance is poor, and customers
also find their code is quite clumsy by first retrieving and populating the column list and
then issuing a deleteColumn for each column in that column-list.
> This feature resolves this problem by introducing a new delete flag: DeleteFamilyVersion.

>   1). When you need to delete all KVs under a column-family with a given timestamp, just
call Delete.deleteFamilyVersion(cfName, timestamp); only a DeleteFamilyVersion type KV is
put to HBase (like DeleteFamily / DeleteColumn / Delete) without read operation;
>   2). Like other delete types, DeleteFamilyVersion takes effect in get/scan/flush/compact
operations, the ScanDeleteTracker now parses out and uses DeleteFamilyVersion to prevent all
KVs under the specific CF which has the same timestamp as the DeleteFamilyVersion KV to pop-up
as part of a get/scan result (also in flush/compact).
> Our customers find this feature efficient, clean and easy-to-use since it does its work
without knowing the exact column names list that needs to be deleted. 
> This feature has been running smoothly for a couple of months in our production clusters.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message