hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "r7raul1984@163.com" <r7raul1...@163.com>
Subject Re: Re: hive 0.14 on some platform return some not NULL value as NULL
Date Thu, 02 Apr 2015 07:36:22 GMT

DDL is
CREATE TABLE dw.fct_traffic_navpage_path_detl( 
date_id string, 
chanl_id bigint, 
sessn_id string, 
gu_id string, 
prov_id string, 
city_id string, 
landing_page_type_id string, 
landing_track_time string, 
landing_url string, 
nav_refer_tracker_id string, 
nav_refer_page_type_id string, 
nav_refer_page_value string, 
nav_refer_link_position string, 
nav_tracker_id string, 
nav_page_categ_id string, 
nav_page_type_id string, 
nav_page_value string, 
nav_srce_type string, 
internal_keyword string, 
internal_result_sum string, 
pltfm_id int, 
app_vers string, 
nav_link_position string, 
nav_button_position string, 
nav_track_time string, 
nav_next_tracker_id string, 
sessn_last_time string, 
sessn_pv int, 
detl_tracker_id string, 
detl_page_type_id string, 
detl_page_value string, 
detl_pm_id bigint, 
detl_link_position string, 
detl_position_track_id string, 
cart_tracker_id string, 
cart_page_type_id string, 
cart_page_value string, 
cart_link_postion string, 
cart_button_position string, 
cart_position_track_id string, 
cart_prod_id bigint, 
ordr_tracker_id string, 
ordr_page_type_id string, 
ordr_code string, 
updt_time string, 
cart_pm_id bigint, 
brand_code string, 
categ_type int, 
os string, 
end_user_id string, 
add_cart_flag string, 
navgation_page_flag int, 
nav_page_url string, 
detl_button_position string, 
manul_flag int, 
manul_track_date string, 
nav_refer_tpa string, 
nav_refer_tpa_id string, 
nav_refer_tpc string, 
nav_refer_tpi string, 
nav_refer_tcs string, 
nav_refer_tcsa string, 
nav_refer_tcdt string, 
nav_refer_tcd string, 
nav_refer_tci string, 
nav_refer_postn_type string, 
nav_tpa_id string, 
nav_tpa string, 
nav_tpc string, 
nav_tpi string, 
nav_tcs string, 
nav_tcsa string, 
nav_tcdt string, 
nav_tcd string, 
nav_tci string, 
nav_postn_type string, 
detl_tpa_id string, 
detl_tpa string, 
detl_tpc string, 
detl_tpi string, 
detl_tcs string, 
detl_tcsa string, 
detl_tcdt string, 
detl_tcd string, 
detl_tci string, 
detl_postn_type string, 
cart_tpa_id string, 
cart_tpa string, 
cart_tpc string, 
cart_tpi string, 
cart_tcs string, 
cart_tcsa string, 
cart_tcdt string, 
cart_tcd string, 
cart_tci string, 
cart_postn_type string, 
sessn_chanl_id bigint, 
gu_sec_flg bigint, 
detl_refer_page_type_id string, 
detl_refer_page_value string, 
detl_event_id string, 
nav_refer_intrn_reslt_sum string, 
nav_intrn_reslt_sum string, 
nav_refer_intrn_kw string, 
nav_intrn_kw string, 
detl_track_time string, 
cart_track_time string) 
PARTITIONED BY ( 
ds string) 
ROW FORMAT SERDE 
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' 
STORED AS INPUTFORMAT 
'org.apache.hadoop.mapred.TextInputFormat' 
OUTPUTFORMAT 
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat' 
LOCATION 
'/user/hive/dw/fct_traffic_navpage_path_detl' 
TBLPROPERTIES ( 
'numPartitions'='265', 
'numFiles'='26677', 
'last_modified_by'='bi_etl', 
'last_modified_time'='1423633028', 
'transient_lastDdlTime'='1427870517', 
'numRows'='0', 
'totalSize'='8268127466928', 
'rawDataSize'='0') 

My query is :
SELECT a1.sessn_id,
       a1.ordr_code,
       a1.cart_tracker_id,
       a1.end_user_id,
       a1.cart_track_time
FROM   dw.fct_traffic_navpage_path_detl a1
WHERE  a1.ds = '2015-01-19'
AND    a1.cart_tracker_id > 0
AND    (a1.cart_button_position IS NULL OR length(a1.cart_button_position) = 0)
AND    a1.sessn_id IN ('Y49EY895ACABHS95DRQEE8DVFEB8JSE1',
                       'UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG',
                       '87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM')


I attach my sample data.



r7raul1984@163.com
 
From: Thejas Nair
Date: 2015-04-02 15:28
To: dev
Subject: Re: hive 0.14 on some platform return some not NULL value as NULL
Can you give more details
- the query you are running
- schema of the table
- serialization format of the table, sample records if possible.
 
 
On Wed, Apr 1, 2015 at 6:32 PM, r7raul1984@163.com <r7raul1984@163.com> wrote:
>
> I use   hive 1.1.0 cli  on computer A (linux)   the result is
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   91501191044440048
> 7326356         NULL
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   121501191035580028
> 7326356         NULL
>
> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG        2362223711289   161501191549050061
> 14837289      NULL
>
> Y49EY895ACABHS95DRQEE8DVFEB8JSE1        2360853052224   111501191426280023
> 115883224       NULL
>
> But I use hive0.14 cli in my test enviroment the result is correct.
>
>
> I use  hive 0.10 on computer B (linux) the result is
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   91501191044440048
> 7326356        2015-01-19 10:44:44
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM        2357378283356   121501191035580028
> 7326356        2015-01-19 10:35:58
>
> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG        2362223711289   161501191549050061
> 14837289     2015-01-19 15:49:05
>
> Y49EY895ACABHS95DRQEE8DVFEB8JSE1        2360853052224   111501191426280023
> 115883224       2015-01-19 14:26:28
>
>
> Why ?
>
> I attach my log. Also in my log I found 2015-04-01 09:55:38,409 WARN [main]
> org.apache.hadoop.hive.serde2.lazy.LazyStruct: Extra bytes detected at the
> end of the row! Ignoring similar problems.
>
>
> ________________________________
> r7raul1984@163.com
Mime
  • Unnamed multipart/mixed (inline, None, 0 bytes)
View raw message