hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Koert Kuipers <>
Subject remove duplicates based on one (or a few) columns
Date Wed, 14 Sep 2011 21:10:35 GMT
what is the easiest way to remove rows which are considered duplicates based
upon a few columns in the rows?
so "create table deduped as select distinct * from table" won't do...

View raw message