Return-Path: X-Original-To: apmail-hawq-dev-archive@minotaur.apache.org Delivered-To: apmail-hawq-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A2B7E18E95 for ; Mon, 1 Feb 2016 17:31:49 +0000 (UTC) Received: (qmail 76166 invoked by uid 500); 1 Feb 2016 17:31:43 -0000 Delivered-To: apmail-hawq-dev-archive@hawq.apache.org Received: (qmail 76111 invoked by uid 500); 1 Feb 2016 17:31:43 -0000 Mailing-List: contact dev-help@hawq.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hawq.incubator.apache.org Delivered-To: mailing list dev@hawq.incubator.apache.org Received: (qmail 76099 invoked by uid 99); 1 Feb 2016 17:31:43 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Feb 2016 17:31:43 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id B2A4118049D for ; Mon, 1 Feb 2016 17:31:42 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.427 X-Spam-Level: X-Spam-Status: No, score=0.427 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-0.554, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id CEGGSU-vDT2L for ; Mon, 1 Feb 2016 17:31:38 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with SMTP id 042CF31AE4 for ; Mon, 1 Feb 2016 17:31:33 +0000 (UTC) Received: (qmail 75813 invoked by uid 99); 1 Feb 2016 17:31:33 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Feb 2016 17:31:33 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 1DF10DFE61; Mon, 1 Feb 2016 17:31:33 +0000 (UTC) From: adamjshook To: dev@hawq.incubator.apache.org Reply-To: dev@hawq.incubator.apache.org References: In-Reply-To: Subject: [GitHub] incubator-hawq pull request: HAWQ-178: Add JSON plugin support in ... Content-Type: text/plain Message-Id: <20160201173133.1DF10DFE61@git1-us-west.apache.org> Date: Mon, 1 Feb 2016 17:31:33 +0000 (UTC) Github user adamjshook commented on the pull request: https://github.com/apache/incubator-hawq/pull/302#issuecomment-178085903 Good to hear from you as well Christian :) I think you're right on the `identifier `as the means to have an end-tag and new-tag. It's slowly coming back. The `StreamReader `seeks until it finds the identifier, then flushes the `StringBuilder` and appends characters until the final end brace is met, accounting for any nested begin brace/end brace combinations. The big problem that happens is if the identifier is embedded as a string inside a JSON record, it would create an incomplete record. If your identifier was `record` in the below example where the split is the horizontal line, bad stuff would happen. This is why the expectation was one file = one split. ``` record : { -------------------------- text : "this is a record" } ``` --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---