hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jayesh (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-6469) skipTrash option in hive command line
Date Tue, 22 Apr 2014 02:11:15 GMT

    [ https://issues.apache.org/jira/browse/HIVE-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13976297#comment-13976297
] 

Jayesh commented on HIVE-6469:
------------------------------

Xuefu,

This is really miner convenient feature which definitely has a use-case for our enterprise
customer.
are you suggesting providing this feature via hive configuration that works in following way
?

set hive.warehouse.data.skipTrash = true	        -- explicitly set
drop table large10TBTable							-- this will skip trash
drop table anyOtherTable							-- this will skip trash
set hive.warehouse.data.skipTrash = false      		-- if you forget this, it will skipTrash
forever, until corrected.
drop table regularTable                           	-- this will start placing data in trash

I believe that approach is not very intuitive and will lead to human error that creates disaster
if necessary steps are not done, which ultimately violates hive feature of providing trash
as backup.  

Also, different environment with different HS2 instance may not be the scenario here. This
has proven to be very helpful on same environment by different users.

Also, I dont think this pollute SQL Syntax, think of this as PURGE option in Oracle DB and
hence I totally see use it being used by enterprise customer.
http://docs.oracle.com/cd/B19306_01/server.102/b14200/statements_9003.htm

Did you get a chance to look at the links I put earlier, where people seen to be searching
for this little convenient feature ?
Also did you get a chance to talk to any customers who would like such feature? Please let
us know.

Thanks
Jayesh

> skipTrash option in hive command line
> -------------------------------------
>
>                 Key: HIVE-6469
>                 URL: https://issues.apache.org/jira/browse/HIVE-6469
>             Project: Hive
>          Issue Type: New Feature
>          Components: CLI
>    Affects Versions: 0.12.0
>            Reporter: Jayesh
>             Fix For: 0.12.1
>
>         Attachments: HIVE-6469.patch
>
>
> hive drop table command deletes the data from HDFS warehouse and puts it into Trash.
> Currently there is no way to provide flag to tell warehouse to skip trash while deleting
table data.
> This ticket is to add skipTrash feature in hive command-line, that looks as following.

> hive -e "drop table skipTrash testTable"
> This would be good feature to add, so that user can specify when not to put data into
trash directory and thus not to fill hdfs space instead of relying on trash interval and policy
configuration to take care of disk filling issue.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message