hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Raghunath, Ranjith" <>
Subject bucketing in hive
Date Thu, 15 Dec 2011 02:23:29 GMT
Can one use bucketing in hive to emulate hash partitions on a database? Is there also a way
to segment data into buckets dynamically based on values in the column. For example,

Col1                       Col2
Apple                    1
Orange                 2
Apple                    2
Banana                 1

If the file above were inserted into a table with Col1 as the bucket column, can we dynamically
allow all of the rows with “Apple” in one file and “Orange” in one file and so on.
Is there a way to do this without specifying the bucket size to be 3.
Thank you,
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message