spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <>
Subject [jira] [Commented] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version
Date Mon, 07 May 2018 13:50:00 GMT


Steve Loughran commented on SPARK-18673:

Josh Rosen added some changes, particularly:

* 8f5918ad3dc7f3aa84ea04f3ef7761493c009d22 Update version to 1.2.1.spark2
* 10d91dca6c602a9f6c6fa428f341f135054c2c16 Re-shade Kryo
* 721aa7e4904a8a6069afe815af7cbf5ed3bde936 Change groupId to org.spark-project.hive; keep
relocated Kryo under Hive namespace.
* aa9f5557b60facfe862f1f6c0a60537da8e88076 Put shaded protobuf classes under Hive package

Int-HDP patches/changes that I also plan to pull n on the basis that (a) they were clearly
deemed important and (b) they apparently work
* HIVE-11102  ReaderImpl: getColumnIndicesFromNames does not work for some cases
* allow the repo for publishing artficats to be reconfigured from the normal sonatype one
* updating the group assembly plugin to use the same package names as from 721aa7e4 

> Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version
> ------------------------------------------------------------------
>                 Key: SPARK-18673
>                 URL:
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.1.0
>         Environment: Spark built with -Dhadoop.version=3.0.0-alpha2-SNAPSHOT 
>            Reporter: Steve Loughran
>            Priority: Major
> Spark Dataframes fail to run on Hadoop 3.0.x, because hive.jar's shimloader considers
3.x to be an unknown Hadoop version.
> Hive itself will have to fix this; as Spark uses its own hive 1.2.x JAR, it will need
to be updated to match.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message