hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [jira] [Created] (HIVE-18603) Use Hash For Partition HDFS File Path
Date Thu, 01 Feb 2018 14:23:00 GMT
BELUGA BEHR created HIVE-18603:

             Summary: Use Hash For Partition HDFS File Path
                 Key: HIVE-18603
             Project: Hive
          Issue Type: Improvement
          Components: HiveServer2
    Affects Versions: 2.3.0, 1.2.0, 3.0.0, 2.4.0
            Reporter: BELUGA BEHR

Currently, for partitioned tables, Hive uses the literal value of each partition in the HDFS
file path.  Instead, perhaps we can use a hash value so that:

 # The partitioned values are obscured to a casual observer in HDFS
 # Remove the chance of having a very long HDFS file name when faced with a very long partitioned
 # Remove the needs to worry about special characters in the partitioned path name as the
hash value would only be HEX string values.


The suggestion here is that we retain the partition values, just as is done now, but the default
HDFS location for each partition will use the hash of the value instead of the value itself.

This message was sent by Atlassian JIRA

View raw message