hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Deepak Jaiswal (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (HIVE-18392) load data should rename files consistent with insert statements (non bucketed tables only)
Date Sat, 06 Jan 2018 23:22:03 GMT

     [ https://issues.apache.org/jira/browse/HIVE-18392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Deepak Jaiswal reassigned HIVE-18392:
-------------------------------------


> load data should rename files consistent with insert statements (non bucketed tables
only)
> ------------------------------------------------------------------------------------------
>
>                 Key: HIVE-18392
>                 URL: https://issues.apache.org/jira/browse/HIVE-18392
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Deepak Jaiswal
>            Assignee: Deepak Jaiswal
>
> Insert statements create files of format ending with 0000_0, 0001_0 etc. However, the
load data uses the input file name. That results in inconsistent naming convention which makes
SMB joins difficult in some scenarios and may cause trouble for other types of queries in
future.
> We need consistent naming convention.
> For non-bucketed table, hive renames all the files regardless of how they were named
by the user.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message