hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From John Lilley <john.lil...@redpoint.net>
Subject typical JSON data sets
Date Tue, 02 Jul 2013 16:04:30 GMT
I would like to hear your experiences working with large JSON data sets, specifically:

1)      How large is each JSON document?

2)      Do they tend to be a single JSON doc per file, or multiples per file?

3)      Do the JSON schemas change over time?

4)      Are there interesting public data sets you would recommend for experiment?
Thanks
John


Mime
View raw message