From common-issues-return-206247-archive-asf-public=cust-asf.ponee.io@hadoop.apache.org Wed Sep 16 18:51:03 2020 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mailroute1-lw-us.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id D2B9C1804BB for ; Wed, 16 Sep 2020 20:51:03 +0200 (CEST) Received: from mail.apache.org (localhost [127.0.0.1]) by mailroute1-lw-us.apache.org (ASF Mail Server at mailroute1-lw-us.apache.org) with SMTP id 088CA123104 for ; Wed, 16 Sep 2020 18:51:03 +0000 (UTC) Received: (qmail 19643 invoked by uid 500); 16 Sep 2020 18:51:02 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 19615 invoked by uid 99); 16 Sep 2020 18:51:02 -0000 Received: from mailrelay1-us-west.apache.org (HELO mailrelay1-us-west.apache.org) (209.188.14.139) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Sep 2020 18:51:02 +0000 Received: from jira-he-de.apache.org (static.172.67.40.188.clients.your-server.de [188.40.67.172]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 89247429FD for ; Wed, 16 Sep 2020 18:51:01 +0000 (UTC) Received: from jira-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira-he-de.apache.org (ASF Mail Server at jira-he-de.apache.org) with ESMTP id 2F7CB7809A8 for ; Wed, 16 Sep 2020 18:51:00 +0000 (UTC) Date: Wed, 16 Sep 2020 18:51:00 +0000 (UTC) From: "ASF GitHub Bot (Jira)" To: common-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Work logged] (HADOOP-17125) Using snappy-java in SnappyCodec MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HADOOP-17125?focusedWorklogId= =3D485313&page=3Dcom.atlassian.jira.plugin.system.issuetabpanels:worklog-ta= bpanel#worklog-485313 ] ASF GitHub Bot logged work on HADOOP-17125: ------------------------------------------- Author: ASF GitHub Bot Created on: 16/Sep/20 18:50 Start Date: 16/Sep/20 18:50 Worklog Time Spent: 10m=20 Work Description: dbtsai commented on a change in pull request #2297: URL: https://github.com/apache/hadoop/pull/2297#discussion_r489663248 ########## File path: hadoop-common-project/hadoop-common/pom.xml ########## @@ -363,6 +363,10 @@ wildfly-openssl-java provided + + org.xerial.snappy Review comment: We can make it provided, and once we create a `hadoop-compression` m= odule, we can add back the jar. @viirya since the jar will be provided, we = need to check if the class exists so we can log it with right message. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 485313) Time Spent: 10.5h (was: 10h 20m) > Using snappy-java in SnappyCodec > -------------------------------- > > Key: HADOOP-17125 > URL: https://issues.apache.org/jira/browse/HADOOP-17125 > Project: Hadoop Common > Issue Type: New Feature > Components: common > Affects Versions: 3.3.0 > Reporter: DB Tsai > Priority: Major > Labels: pull-request-available > Time Spent: 10.5h > Remaining Estimate: 0h > > In Hadoop, we use native libs for snappy codec which has several disadvan= tages: > * It requires native *libhadoop* and *libsnappy* to be installed in syst= em *LD_LIBRARY_PATH*, and they have to be installed separately on each node= of the clusters, container images, or local test environments=C2=A0which a= dds huge complexities from deployment point of view. In some environments, = it requires compiling the natives from sources which is non-trivial. Also, = this approach is platform dependent; the binary may not work in different p= latform, so it requires recompilation. > * It requires extra configuration of *java.library.path* to load the nat= ives, and it results higher application deployment and maintenance cost for= users. > Projects such as *Spark* and *Parquet*=C2=A0use [snappy-java|[https://git= hub.com/xerial/snappy-java]] which is JNI-based implementation. It contains= native binaries for Linux, Mac, and IBM in jar file, and it can automatica= lly load the native binaries into JVM from jar without any setup.=C2=A0If a= native implementation can not be found for a platform, it can fallback to = pure-java implementation of snappy based on [aircompressor|[https://github.= com/airlift/aircompressor/tree/master/src/main/java/io/airlift/compress/sna= ppy]]. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: common-issues-help@hadoop.apache.org