hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mohit Anchlia <mohitanch...@gmail.com>
Subject Dealing with changing file format
Date Mon, 02 Jul 2012 21:09:56 GMT
I am wondering what's the right way to go about designing reading input and
output where file format may change over period. For instance we might
start with "field1,field2,field3" but at some point we add new field4 in
the input. What's the best way to deal with such scenarios? Keep a catalog
of changes that timestamped?

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message