flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stephan Ewen <se...@apache.org>
Subject Re: JSON data source for Flink Job
Date Thu, 28 May 2015 10:39:23 GMT

This depends a bit how the JSON is formatted.

If you want the source to be parallelizable, you need to have a way of
splitting the file at object boundaries. Is there a character on which you
can split? If yes, you can use theTextInputFormat (with a custom line break
character), take the strings and parse them to JSON with your favorite
library (like Jackson or so).


On Thu, May 28, 2015 at 12:24 PM, Tamara Mendt <tammymendt@gmail.com> wrote:

> Hello,
> I have a JSON file containing multiple JSON objects and wish to use this
> as a data source for a Flink Job.
> What is the best way to do this?
> Cheers,
> Tamara

View raw message