drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rahul Challapalli (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-3266) Drill's hive storage plugin cannot find RegexSerDe
Date Tue, 09 Jun 2015 18:58:00 GMT
Rahul Challapalli created DRILL-3266:

             Summary: Drill's hive storage plugin cannot find RegexSerDe
                 Key: DRILL-3266
                 URL: https://issues.apache.org/jira/browse/DRILL-3266
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Hive
            Reporter: Rahul Challapalli
            Assignee: Venki Korukanti


HIve DDL :
CREATE EXTERNAL TABLE IF NOT EXISTS hive_storage.raw_access_log(
remote_host STRING,
remote_logname STRING,
remote_user STRING,
request_time STRING,
first_line STRING,
http_status STRING,
bytes STRING
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
"input.regex" = "([^ ]*) ([^ ]*) ([^ ]*) (-|\\[[^\\]]*\\]) ([^ \"]*|\"[^\"]*\") (-|[0-9]*)
"output.format.string" = "%1$s %2$s %3$s %4$s %5$s %6$s %7$s"
LOCATION '/drill/testdata/hive_storage/raw_access_log';

When I try to query this table from sqlline, I get the below error
0: jdbc:drill:schema=dfs_eea> select * from raw_access_log;
Error: PARSE ERROR: MetaException(message:java.lang.ClassNotFoundException Class org.apache.hadoop.hive.contrib.serde2.RegexSerDe
not found)

[Error Id: 5961c4c4-1cf5-495c-9c8a-0616b58d2488 on qa-node190.qa.lab:31010] (state=,code=0)

The query works from hive without any issues. I attached the data set and the error log

This message was sent by Atlassian JIRA

View raw message