drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ssriniva123 <...@git.apache.org>
Subject [GitHub] drill issue #518: DRILL-4653.json - Malformed JSON should not stop the entir...
Date Tue, 27 Sep 2016 06:34:47 GMT
Github user ssriniva123 commented on the issue:

    https://github.com/apache/drill/pull/518
  
    Paul,
    The code you have listed is semantically equivalent to that of what I already I have submitted
for pull and will not solve handling of all malformed json records. Also the code for reporting
the 
    error records is working correctly as long as is it is reported by the Parser correctly.
    
    As I explained earlier the JSON parser is not just a simple tokenizer, it keeps track
of internal state,
    hence the issue. SERDE's in hive etc work because they  are record oriented with clean
record demarkations using a new line.
    
    One solution is to submit a patch to jackson parser to expose a method to skip to new
line in the
    event of a parsing exception. This can be parametrized so that behavior can customized.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message