hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ayazur Rehman <rehman.ayazu...@gmail.com>
Subject Re: Reading Json files in hive
Date Fri, 05 Jun 2015 15:55:05 GMT
What is the meaning of fix column...does it mean non-nested columns?
So I decided to use the UDF ...
But I am still struggling with a command to create a table .... Is it
necessary to use String datatype...even if I have integers....and how do I
specify the nested data types....like the following ....

"columnD": {"some_string":10.0}

Also, how can I run group by queries on this file...

Apologies for the silly questions...I haven't dealt with complicated things
on hive and..the latest stable release of Pig has a bug...where it can't
read array data type...


On Fri, Jun 5, 2015 at 11:33 AM, DU DU <willddy@gmail.com> wrote:

> Using Json serde requires you have fix column. In your case, you should
> consider using HIVE UDF mentioned above
>
> On Fri, Jun 5, 2015 at 11:13 AM, Wollert, Fabian <
> fabian.wollert@zalando.de> wrote:
>
>> you can either pursuit the approaches from the others (reading the whole
>> table with JSON Input Format) or you read the File with normal Input Format
>> and STRING Datatype and then apply the function i mentioned afterwards.
>> Both Approaches work, i usually use mine. I don't know though which one is
>> best for performance.
>>
>> Cheers
>> Fabian
>>
>> 2015-06-05 17:03 GMT+02:00 Ayazur Rehman <rehman.ayazur90@gmail.com>:
>>
>>> Hi Fabian,
>>>
>>> The documentation only points to how to run some queries with json
>>> format table and I could not find the documentation for creating a Json
>>> Table. Did I miss something in the documentation?
>>> I apologize if my question was not clear.
>>> Thanks
>>>
>>> On Fri, Jun 5, 2015 at 10:38 AM, Wollert, Fabian <
>>> fabian.wollert@zalando.de> wrote:
>>>
>>>> Hey Ayazur, please see here for doc's on how to read JSON in Hive:
>>>>
>>>>
>>>> https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-get_json_object
>>>>
>>>> Cheers
>>>> Fabian
>>>>
>>>> 2015-06-05 16:30 GMT+02:00 Ayazur Rehman <rehman.ayazur90@gmail.com>:
>>>>
>>>>> Hi,
>>>>>
>>>>> Can someone tell me how to read Json file in hive
>>>>> My Hive version is 0.14
>>>>>
>>>>> the schema is something like this-
>>>>>
>>>>> {"columnA": "some_string", "columnB": 227958.0, "columnC": [0.0, 0.0,
0.0, 0.0], "columnD": {"some_string":10.0}}
>>>>>
>>>>> If there is documentation about it, please let me know as well
>>>>>
>>>>> --
>>>>> Thanking You,
>>>>> Ayazur Rehman
>>>>> +1 720 917 8540
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> *Fabian Wollert*
>>>> Business Intelligence
>>>>
>>>>
>>>>
>>>> *POSTAL ADDRESS*
>>>> Zalando SE
>>>> 11501 Berlin
>>>>
>>>> *OFFICE*
>>>> Zalando SE
>>>> Mollstraße 1
>>>> 10178 Berlin
>>>> Germany
>>>>
>>>> Phone: +49 30 20968 1819
>>>> Fax:   +49 30 27594 693
>>>> E-Mail: fabian.wollert@zalando.de
>>>> Web: www.zalando.de
>>>> Jobs: jobs.zalando.de
>>>>
>>>> Zalando SE, Tamara-Danz-Straße 1, 10243 Berlin
>>>> Company registration: Amtsgericht Charlottenburg, HRB 158855 B
>>>> Tax ID: 29/560/00596 * VAT registration number: DE 260543043
>>>> Management Board: Robert Gentz, David Schneider, Rubin Ritter
>>>> Chairperson of the Supervisory Board: Cristina Stenbeck
>>>> Registered office: Berlinn
>>>>
>>>
>>>
>>>
>>> --
>>> Thanking You,
>>> Ayazur Rehman
>>> +1 720 917 8540
>>>
>>
>>
>>
>> --
>> *Fabian Wollert*
>> Business Intelligence
>>
>>
>>
>> *POSTAL ADDRESS*
>> Zalando SE
>> 11501 Berlin
>>
>> *OFFICE*
>> Zalando SE
>> Mollstraße 1
>> 10178 Berlin
>> Germany
>>
>> Phone: +49 30 20968 1819
>> Fax:   +49 30 27594 693
>> E-Mail: fabian.wollert@zalando.de
>> Web: www.zalando.de
>> Jobs: jobs.zalando.de
>>
>> Zalando SE, Tamara-Danz-Straße 1, 10243 Berlin
>> Company registration: Amtsgericht Charlottenburg, HRB 158855 B
>> Tax ID: 29/560/00596 * VAT registration number: DE 260543043
>> Management Board: Robert Gentz, David Schneider, Rubin Ritter
>> Chairperson of the Supervisory Board: Cristina Stenbeck
>> Registered office: Berlinn
>>
>
>
>
> --
> Thanks,
> Will
>



-- 
Thanking You,
Ayazur Rehman
+1 720 917 8540

Mime
View raw message