hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ravi Prakash (JIRA)" <>
Subject [jira] [Commented] (HIVE-6469) skipTrash option in hive command line
Date Tue, 20 May 2014 20:18:42 GMT


Ravi Prakash commented on HIVE-6469:


The patch as it has gone in, doesn't enable our use case that I am reiterating here:
bq. The use case that is being targeted here is that a user may on 1 instance choose to drop
a (possibly big) table without sending it to Trash to avoid filling up her/his quota. We believe
that the default Hive behavior of sending to Trash should be maintained (to prevent accidental
data loss).
This is is because the environment variable is not being communicated from the client to the

I can sympathize with Xuefu's concern to not pollute SQL syntax. Hence I am going to open
a new JIRA for providing that functionality without extending the SQL syntax.

> skipTrash option in hive command line
> -------------------------------------
>                 Key: HIVE-6469
>                 URL:
>             Project: Hive
>          Issue Type: New Feature
>          Components: CLI
>    Affects Versions: 0.12.0
>            Reporter: Jayesh
>            Assignee: Jayesh
>             Fix For: 0.14.0
>         Attachments: HIVE-6469.1.patch, HIVE-6469.2.patch, HIVE-6469.3.patch, HIVE-6469.patch
> hive drop table command deletes the data from HDFS warehouse and puts it into Trash.
> Currently there is no way to provide flag to tell warehouse to skip trash while deleting
table data.
> This ticket is to add skipTrash feature in hive command-line, that looks as following.

> hive -e "drop table skipTrash testTable"
> This would be good feature to add, so that user can specify when not to put data into
trash directory and thus not to fill hdfs space instead of relying on trash interval and policy
configuration to take care of disk filling issue.

This message was sent by Atlassian JIRA

View raw message