hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Yang (JIRA)" <>
Subject [jira] Commented: (HIVE-1575) get_json_object does not support JSON array at the root level
Date Tue, 26 Oct 2010 17:50:21 GMT


Paul Yang commented on HIVE-1575:

It's hard to say, but my guess is that a regex will be slower than those string operations.
Same thing with the cache. What might be good to do is compare the performance before and
after these changes. Do you have a dataset that you could use to test?

> get_json_object does not support JSON array at the root level
> -------------------------------------------------------------
>                 Key: HIVE-1575
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: UDF
>    Affects Versions: 0.7.0
>            Reporter: Steven Wong
>            Assignee: Mike Lewis
>         Attachments: 0001-Updated-UDFJson-to-allow-arrays-as-a-root-object.patch
> Currently, get_json_object(json_txt, path) always returns null if json_txt is not a JSON
object (e.g. is a JSON array) at the root level.
> I have a table column of JSON arrays at the root level, but I can't parse it because
of that.
> get_json_object should accept any JSON value (string, number, object, array, true, false,
null), not just object, at the root level. In other words, it should behave as if it were
named get_json_value or simply get_json.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message