hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "r7raul1984@163.com" <r7raul1...@163.com>
Subject Re: Re: hive 0.14 on some platform return some not NULL value as NULL
Date Thu, 02 Apr 2015 09:12:20 GMT
Sorry ,I check my production jdk is java version "1.7.0_45"   not   java version "1.6.0_35"




r7raul1984@163.com
 
From: r7raul1984@163.com
Date: 2015-04-02 17:01
To: dev
Subject: Re: Re: hive 0.14 on some platform return some not NULL value as NULL
I download full data from hdfs.  Then load data into my table. In my test enviroment. Everything
is ok.
My production is  hadoop 2.3.0-cdh 5.0.2   REDHAT 5.8   java version "1.6.0_35" 





r7raul1984@163.com
 
From: r7raul1984@163.com
Date: 2015-04-02 16:57
To: dev
Subject: Re: Re: hive 0.14 on some platform return some not NULL value as NULL
In my test enviroment . I use hive 0.14 ,hive 1.1.0  ,the result is ok.
But in production enviroment  ,the result is not correct.



r7raul1984@163.com
 
From: Thejas Nair
Date: 2015-04-02 16:41
To: r7raul1984@163.com
CC: dev
Subject: Re: Re: hive 0.14 on some platform return some not NULL value as NULL
I am unable to reproduce this issue using the sample data -
 
For this query, using 1.1.0, i get the following result-
87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356
91501191044440048       7326356 2015-01-19 10:44:442015-01-19
 
On Thu, Apr 2, 2015 at 12:36 AM, r7raul1984@163.com <r7raul1984@163.com> wrote:
>
> DDL is
> CREATE TABLE dw.fct_traffic_navpage_path_detl(
> date_id string,
> chanl_id bigint,
> sessn_id string,
> gu_id string,
> prov_id string,
> city_id string,
> landing_page_type_id string,
> landing_track_time string,
> landing_url string,
> nav_refer_tracker_id string,
> nav_refer_page_type_id string,
> nav_refer_page_value string,
> nav_refer_link_position string,
> nav_tracker_id string,
> nav_page_categ_id string,
> nav_page_type_id string,
> nav_page_value string,
> nav_srce_type string,
> internal_keyword string,
> internal_result_sum string,
> pltfm_id int,
> app_vers string,
> nav_link_position string,
> nav_button_position string,
> nav_track_time string,
> nav_next_tracker_id string,
> sessn_last_time string,
> sessn_pv int,
> detl_tracker_id string,
> detl_page_type_id string,
> detl_page_value string,
> detl_pm_id bigint,
> detl_link_position string,
> detl_position_track_id string,
> cart_tracker_id string,
> cart_page_type_id string,
> cart_page_value string,
> cart_link_postion string,
> cart_button_position string,
> cart_position_track_id string,
> cart_prod_id bigint,
> ordr_tracker_id string,
> ordr_page_type_id string,
> ordr_code string,
> updt_time string,
> cart_pm_id bigint,
> brand_code string,
> categ_type int,
> os string,
> end_user_id string,
> add_cart_flag string,
> navgation_page_flag int,
> nav_page_url string,
> detl_button_position string,
> manul_flag int,
> manul_track_date string,
> nav_refer_tpa string,
> nav_refer_tpa_id string,
> nav_refer_tpc string,
> nav_refer_tpi string,
> nav_refer_tcs string,
> nav_refer_tcsa string,
> nav_refer_tcdt string,
> nav_refer_tcd string,
> nav_refer_tci string,
> nav_refer_postn_type string,
> nav_tpa_id string,
> nav_tpa string,
> nav_tpc string,
> nav_tpi string,
> nav_tcs string,
> nav_tcsa string,
> nav_tcdt string,
> nav_tcd string,
> nav_tci string,
> nav_postn_type string,
> detl_tpa_id string,
> detl_tpa string,
> detl_tpc string,
> detl_tpi string,
> detl_tcs string,
> detl_tcsa string,
> detl_tcdt string,
> detl_tcd string,
> detl_tci string,
> detl_postn_type string,
> cart_tpa_id string,
> cart_tpa string,
> cart_tpc string,
> cart_tpi string,
> cart_tcs string,
> cart_tcsa string,
> cart_tcdt string,
> cart_tcd string,
> cart_tci string,
> cart_postn_type string,
> sessn_chanl_id bigint,
> gu_sec_flg bigint,
> detl_refer_page_type_id string,
> detl_refer_page_value string,
> detl_event_id string,
> nav_refer_intrn_reslt_sum string,
> nav_intrn_reslt_sum string,
> nav_refer_intrn_kw string,
> nav_intrn_kw string,
> detl_track_time string,
> cart_track_time string)
> PARTITIONED BY (
> ds string)
> ROW FORMAT SERDE
> 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
> STORED AS INPUTFORMAT
> 'org.apache.hadoop.mapred.TextInputFormat'
> OUTPUTFORMAT
> 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
> LOCATION
> '/user/hive/dw/fct_traffic_navpage_path_detl'
> TBLPROPERTIES (
> 'numPartitions'='265',
> 'numFiles'='26677',
> 'last_modified_by'='bi_etl',
> 'last_modified_time'='1423633028',
> 'transient_lastDdlTime'='1427870517',
> 'numRows'='0',
> 'totalSize'='8268127466928',
> 'rawDataSize'='0')
>
> My query is :
>
> SELECT a1.sessn_id,
>
>        a1.ordr_code,
>
>        a1.cart_tracker_id,
>
>        a1.end_user_id,
>
>        a1.cart_track_time
>
> FROM   dw.fct_traffic_navpage_path_detl a1
>
> WHERE  a1.ds = '2015-01-19'
>
> AND    a1.cart_tracker_id > 0
>
> AND    (a1.cart_button_position IS NULL OR length(a1.cart_button_position) =
> 0)
>
> AND    a1.sessn_id IN ('Y49EY895ACABHS95DRQEE8DVFEB8JSE1',
>
>                        'UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG',
>
>                        '87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM')
>
>
>
> I attach my sample data.
>
>
> ________________________________
> r7raul1984@163.com
>
>
> From: Thejas Nair
> Date: 2015-04-02 15:28
> To: dev
> Subject: Re: hive 0.14 on some platform return some not NULL value as NULL
> Can you give more details
> - the query you are running
> - schema of the table
> - serialization format of the table, sample records if possible.
>
>
> On Wed, Apr 1, 2015 at 6:32 PM, r7raul1984@163.com <r7raul1984@163.com>
> wrote:
>>
>> I use   hive 1.1.0 cli  on computer A (linux)   the result is
>>
>> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   91501191044440048
>> 7326356         NULL
>>
>> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   121501191035580028
>> 7326356         NULL
>>
>> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG        2362223711289   161501191549050061
>> 14837289      NULL
>>
>> Y49EY895ACABHS95DRQEE8DVFEB8JSE1        2360853052224   111501191426280023
>> 115883224       NULL
>>
>> But I use hive0.14 cli in my test enviroment the result is correct.
>>
>>
>> I use  hive 0.10 on computer B (linux) the result is
>>
>> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   91501191044440048
>> 7326356        2015-01-19 10:44:44
>>
>> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   121501191035580028
>> 7326356        2015-01-19 10:35:58
>>
>> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG        2362223711289   161501191549050061
>> 14837289     2015-01-19 15:49:05
>>
>> Y49EY895ACABHS95DRQEE8DVFEB8JSE1        2360853052224   111501191426280023
>> 115883224       2015-01-19 14:26:28
>>
>>
>> Why ?
>>
>> I attach my log. Also in my log I found 2015-04-01 09:55:38,409 WARN
>> [main]
>> org.apache.hadoop.hive.serde2.lazy.LazyStruct: Extra bytes detected at the
>> end of the row! Ignoring similar problems.
>>
>>
>> ________________________________
>> r7raul1984@163.com
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message