hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xing Pan (Jira)" <>
Subject [jira] [Commented] (HUDI-376) AWS Glue dependency issue for EMR 5.28.0
Date Mon, 06 Jan 2020 01:54:00 GMT


Xing Pan commented on HUDI-376:

[~xleesf] sorry for the delay of response.

I'd like to send a PR,  but I think the script "" in github repo is different
from the script in EMR.

I am not sure where the source code of EMR version of "" is.

But surely I can send a PR to add document of aws-configs.

> AWS Glue dependency issue for EMR 5.28.0
> ----------------------------------------
>                 Key: HUDI-376
>                 URL:
>             Project: Apache Hudi (incubating)
>          Issue Type: Improvement
>          Components: Usability
>            Reporter: Xing Pan
>            Priority: Minor
>             Fix For: 0.5.1
> Hi hudi team, it's really encouraging that Hudi is finally officially supported application
on AWS EMR. Great job!
> I found a *ClassNotFound* exception when using:
> {code:java}
> /usr/lib/hudi/bin/
> {code}
> in emr master.
> And I think is due to demand of aws glue data sdk dependency. (I used aws glue as hive
meta data)
> So I added a line to to get a quick fix for this:
> {code:java}
> HIVE_JARS=$HIVE_JARS:/usr/lib/hive/auxlib/aws-glue-datacatalog-hive2-client.jar:/usr/share/aws/emr/emr-metrics-collector/lib/aws-java-sdk-glue-1.11.475.jar{code}
> not sure if any more jars needed, but these two jar fixed my problem.
> I think it would be great if take glue in consideration for emr scripts.

This message was sent by Atlassian Jira

View raw message