Mailing-List: contact issues-help@spark.apache.org; run by ezmlm
Precedence: bulk
Date: Tue, 8 Nov 2016 07:08:58 +0000 (UTC)
From: "Reynold Xin (JIRA)" <jira@apache.org>
To: issues@spark.apache.org
Message-ID: <JIRA.13019124.1478588881000.218668.1478588938321@Atlassian.JIRA>
In-Reply-To: <JIRA.13019124.1478588881000@Atlassian.JIRA>
References: <JIRA.13019124.1478588881000@Atlassian.JIRA> <JIRA.13019124.1478588881475@arcas>
Subject: [jira] [Created] (SPARK-18352) Parse normal JSON files (not just
 JSON Lines)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
archived-at: Tue, 08 Nov 2016 07:09:00 -0000

Reynold Xin created SPARK-18352:
-----------------------------------

             Summary: Parse normal JSON files (not just JSON Lines)
                 Key: SPARK-18352
                 URL: https://issues.apache.org/jira/browse/SPARK-18352
             Project: Spark
          Issue Type: New Feature
          Components: SQL
            Reporter: Reynold Xin


Spark currently can only parse JSON files that are JSON lines, i.e. each record has an entire line and records are separated by new line. In reality, a lot of users want to use Spark to parse actual JSON files, and are surprised to learn that it doesn't do that.

We can introduce a new mode (wholeJsonFile?) in which we don't split the files, and rather stream through them to parse the JSON files.


--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org