From issues-return-44326-archive-asf-public=cust-asf.ponee.io@tez.apache.org Mon Jun 7 11:53:24 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-ec2-va.apache.org (mxout1-ec2-va.apache.org [3.227.148.255]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id 8D13118064E for ; Mon, 7 Jun 2021 13:53:24 +0200 (CEST) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-ec2-va.apache.org (ASF Mail Server at mxout1-ec2-va.apache.org) with SMTP id 423793FDA9 for ; Mon, 7 Jun 2021 11:53:08 +0000 (UTC) Received: (qmail 13024 invoked by uid 500); 7 Jun 2021 11:53:03 -0000 Mailing-List: contact issues-help@tez.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tez.apache.org Delivered-To: mailing list issues@tez.apache.org Received: (qmail 12934 invoked by uid 99); 7 Jun 2021 11:53:03 -0000 Received: from mailrelay1-he-de.apache.org (HELO mailrelay1-he-de.apache.org) (116.203.21.61) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Jun 2021 11:53:03 +0000 Received: from jira2-he-de.apache.org (jira2-he-de.apache.org [168.119.33.54]) by mailrelay1-he-de.apache.org (ASF Mail Server at mailrelay1-he-de.apache.org) with ESMTPS id 4B18D3E8B3 for ; Mon, 7 Jun 2021 11:53:00 +0000 (UTC) Received: from jira2-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira2-he-de.apache.org (ASF Mail Server at jira2-he-de.apache.org) with ESMTP id 2BEEEC80587 for ; Mon, 7 Jun 2021 11:53:00 +0000 (UTC) Date: Mon, 7 Jun 2021 11:53:00 +0000 (UTC) From: =?utf-8?Q?L=C3=A1szl=C3=B3_Bodor_=28Jira=29?= To: issues@tez.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (TEZ-4295) Could not decompress data. Buffer length is too small. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/TEZ-4295?page=3Dcom.atlassian.j= ira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D173585= 51#comment-17358551 ]=20 L=C3=A1szl=C3=B3 Bodor commented on TEZ-4295: ----------------------------------- in the meantime I realized that synchronization is only needed on CodecUtil= s.getDecompressedInputStreamWithBufferSize, because compressor.reinit(conf)= is only used for compressors and not for decompressors, moreover we don't = use getCompressor with conf parameter from tez attached a unit test for proving the fix > Could not decompress data. Buffer length is too small. > ------------------------------------------------------ > > Key: TEZ-4295 > URL: https://issues.apache.org/jira/browse/TEZ-4295 > Project: Apache Tez > Issue Type: Bug > Affects Versions: 0.10.0 > Reporter: junnan.yang > Assignee: L=C3=A1szl=C3=B3 Bodor > Priority: Major > Fix For: 0.10.1 > > Attachments: TEZ-4295.01.patch > > Time Spent: 0.5h > Remaining Estimate: 0h > > tez =E4=BD=BF=E7=94=A8snappy=E5=8E=8B=E7=BC=A9=E6=96=B9=E5=BC=8F=E6=97=B6= =EF=BC=8C=E4=BC=9A=E6=8A=A5=E9=94=99=E7=BC=93=E5=86=B2=E5=8C=BA=E5=A4=AA=E5= =B0=8F=EF=BC=9A > java.io.IOException: java.lang.InternalError: Could not decompress data. = Buffer length is too small.java.io.IOException: java.lang.InternalError: Co= uld not decompress data. Buffer length is too small. at org.apache.tez.runt= ime.library.common.shuffle.ShuffleUtils.shuffleToMemory(ShuffleUtils.java:1= 37) at org.apache.tez.runtime.library.common.shuffle.orderedgrouped.Fetcher= OrderedGrouped.copyMapOutput(FetcherOrderedGrouped.java:550) at org.apache.= tez.runtime.library.common.shuffle.orderedgrouped.FetcherOrderedGrouped.cop= yFromHost(FetcherOrderedGrouped.java:283) at org.apache.tez.runtime.library= .common.shuffle.orderedgrouped.FetcherOrderedGrouped.fetchNext(FetcherOrder= edGrouped.java:182) at org.apache.tez.runtime.library.common.shuffle.ordere= dgrouped.FetcherOrderedGrouped.callInternal(FetcherOrderedGrouped.java:194)= at org.apache.tez.runtime.library.common.shuffle.orderedgrouped.FetcherOrd= eredGrouped.callInternal(FetcherOrderedGrouped.java:57) at org.apache.tez.c= ommon.CallableWithNdc.call(CallableWithNdc.java:36) at com.google.common.ut= il.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.ru= nInterruptibly(TrustedListenableFutureTask.java:111) at com.google.common.u= til.concurrent.InterruptibleTask.run(InterruptibleTask.java:58) at com.goog= le.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenable= FutureTask.java:75) at java.util.concurrent.ThreadPoolExecutor.runWorker(Th= readPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Work= er.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748= )Caused by: java.lang.InternalError: Could not decompress data. Buffer leng= th is too small. at org.apache.hadoop.io.compress.snappy.SnappyDecompressor= .decompressBytesDirect(Native Method) at org.apache.hadoop.io.compress.snap= py.SnappyDecompressor.decompress(SnappyDecompressor.java:238) at org.apache= .hadoop.io.compress.BlockDecompressorStream.decompress(BlockDecompressorStr= eam.java:88) at org.apache.hadoop.io.compress.DecompressorStream.read(Decom= pressorStream.java:105) at org.apache.hadoop.io.IOUtils.readFully(IOUtils.j= ava:210) at org.apache.tez.runtime.library.common.sort.impl.IFile$Reader.re= adToMemory(IFile.java:833) at org.apache.tez.runtime.library.common.shuffle= .ShuffleUtils.shuffleToMemory(ShuffleUtils.java:121) ... 12 more -- This message was sent by Atlassian Jira (v8.3.4#803005)