drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Venki Korukanti (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-3266) Drill's hive storage plugin cannot find RegexSerDe
Date Wed, 10 Jun 2015 18:16:01 GMT

    [ https://issues.apache.org/jira/browse/DRILL-3266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14580902#comment-14580902
] 

Venki Korukanti commented on DRILL-3266:
----------------------------------------

This particular SerDe is part of the hive-contrib.jar which we don't package with Drill. Fix
would be to fetch this jar as part of the hive storage plugin, so that SerDes shipped with
Hive are by default available in Drill.

> Drill's hive storage plugin cannot find RegexSerDe
> --------------------------------------------------
>
>                 Key: DRILL-3266
>                 URL: https://issues.apache.org/jira/browse/DRILL-3266
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Hive
>            Reporter: Rahul Challapalli
>            Assignee: Venki Korukanti
>         Attachments: apache_log.txt, error.log
>
>
> git.commit.id.abbrev=5f26b8b
> HIve DDL :
> {code}
> CREATE EXTERNAL TABLE IF NOT EXISTS hive_storage.raw_access_log(
> remote_host STRING,
> remote_logname STRING,
> remote_user STRING,
> request_time STRING,
> first_line STRING,
> http_status STRING,
> bytes STRING
> )
> ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
> WITH SERDEPROPERTIES (
> "input.regex" = "([^ ]*) ([^ ]*) ([^ ]*) (-|\\[[^\\]]*\\]) ([^ \"]*|\"[^\"]*\") (-|[0-9]*)
(-|[0-9]*)",
> "output.format.string" = "%1$s %2$s %3$s %4$s %5$s %6$s %7$s"
> )
> LOCATION '/drill/testdata/hive_storage/raw_access_log';
> {code}
> When I try to query this table from sqlline, I get the below error
> {code}
> 0: jdbc:drill:schema=dfs_eea> select * from raw_access_log;
> Error: PARSE ERROR: MetaException(message:java.lang.ClassNotFoundException Class org.apache.hadoop.hive.contrib.serde2.RegexSerDe
not found)
> [Error Id: 5961c4c4-1cf5-495c-9c8a-0616b58d2488 on qa-node190.qa.lab:31010] (state=,code=0)
> {code}
> The query works from hive without any issues. I attached the data set and the error log



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message