Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id E7489200B67 for ; Tue, 16 Aug 2016 22:42:33 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id E6022160AA8; Tue, 16 Aug 2016 20:42:33 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 137CB160A74 for ; Tue, 16 Aug 2016 22:42:32 +0200 (CEST) Received: (qmail 8141 invoked by uid 500); 16 Aug 2016 20:42:32 -0000 Mailing-List: contact dev-help@poi.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "POI Developers List" Delivered-To: mailing list dev@poi.apache.org Received: (qmail 8122 invoked by uid 99); 16 Aug 2016 20:42:31 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Aug 2016 20:42:31 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 86C76180A29 for ; Tue, 16 Aug 2016 20:42:31 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.298 X-Spam-Level: * X-Spam-Status: No, score=1.298 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id DF5kSClKeyPW for ; Tue, 16 Aug 2016 20:42:28 +0000 (UTC) Received: from mout.gmx.net (mout.gmx.net [212.227.15.19]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id E0AD05FBB5 for ; Tue, 16 Aug 2016 20:42:27 +0000 (UTC) Received: from mail-wm0-f49.google.com ([74.125.82.49]) by mail.gmx.com (mrgmx002) with ESMTPSA (Nemesis) id 0MbgWb-1bs7YB1Xz3-00IyXX for ; Tue, 16 Aug 2016 22:42:14 +0200 Received: by mail-wm0-f49.google.com with SMTP id f65so163068772wmi.0 for ; Tue, 16 Aug 2016 13:42:04 -0700 (PDT) X-Gm-Message-State: AEkoout+XEay3+DCZXZ0phf6GUmagY4iHfToHp6heNvjQVxlngYu+nblqFQg5sD/kW9Q+z4y8dQBQXPgvqO7IQ== X-Received: by 10.28.134.14 with SMTP id i14mr22052505wmd.59.1471380123854; Tue, 16 Aug 2016 13:42:03 -0700 (PDT) MIME-Version: 1.0 Received: by 10.28.159.205 with HTTP; Tue, 16 Aug 2016 13:42:03 -0700 (PDT) In-Reply-To: References: <57ABAEED.1010501@apache.org> <712c7c0b-7279-2977-1bea-2bfeefeda834@corefiling.co.uk> From: Dominik Stadler Date: Tue, 16 Aug 2016 22:42:03 +0200 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [VOTE] Apache POI 3.15-beta3 To: POI Developers List Content-Type: multipart/alternative; boundary=001a114416b89cb3a9053a3661fa X-Provags-ID: V03:K0:rMk2kqSxy+T0Vzlw+QV1KkQ6y/Io139R66Nj8QsTRNgskZ8fTv/ k19R35sRi2chEMc7VKhamHL23xeAHKob8O+AGMT2hTCDWUrPdLVC0T3DHkuhkjqlRVwhwsh fx5TSonN887AtNWxyTCCEkb34a3sWjr0zDarzk8/V+UyxucwWfWVVuxyWz7q1ujXCg1M0H+ 3jKlzONDblIb5mAy8RTfg== X-UI-Out-Filterresults: notjunk:1;V01:K0:SW7Ogg1jUII=:AEMe2kFSWRFf5XFEurZEh9 rUMQmoL6JvNys/9NMazgHs+uCfvLwK15L4wxvaeSx0/nZZmeYxIEcAuItt5ZdgkT/zsCik9mK G1snthLxvanikjmDsgo+FO8ke6Qkg6oO4I8FpQ+j5woRQunh553B2IiU86/bN7l2KYgOHNjff RSkQ33bBNtznNz5c57XRxpOOJ4WfWkq6CNeJGQFVfomlU5zS0niJHyUdA997JPP/FcHkUxFVh pJum4lN5prGnDFE79f/q+PWssn3uYq+L7Tt6lFdVyBG5ew/1v8+tpvdbobTJWnWIuaqyXFjDd 4EdeJ7vxaP7xDh98R75UUhStyv6p6BRU6W/DrRnypVOS3UEyVKa2RCz1RObNuw1r+wi1jXBd+ bCt4zcmUKiFFY2TvtXcGwxsQjESNByMGDCyZtUJna1UVQAnAffrb1L1duMfAQFVYECuSGo4li jGK9oQd67Qs3lJNlxXT+zeL7WZ1pWv1M4kyMSt3c7LpUrHkmBXpCKZHrzvSyVV7lRUK1N8zuV hWtPR6CFpLRkw1Uhw2X3pIVRp5uYGi/M6pKJKcA1tGez/tmPhs8zzQGV04eGGEaK0wseyWl06 /oo7UwHgRoU+BknVAQnK3+HdGuMnQQW8dfMerun3bFl1DvSWEEO+q5vkh60clKxVQiFQ1TDCi 3KyeVGBowySIHqYU79Or3JmJ9jdn4Wt2Op77MtnPnoul8lVUAQmjn5vksOFKD/jB2E1EBSHNr j28lq3217yyOkLhLUj4A2gUREqBme9kulxkwjFjNxG51go1dZz4Fx/8sKURKi+yHTOzrgJGoA 3U+L9NJWgYx/XpSmJbMycDBUyUAwg== archived-at: Tue, 16 Aug 2016 20:42:34 -0000 --001a114416b89cb3a9053a3661fa Content-Type: text/plain; charset=UTF-8 Hi, The 2nd item is a regression, so I didn't think it should be related to those code-pieces as we did not change them compared to previous versions. Therefore I git-bisected this and it seems the following change causes this regression: https://svn.apache.org/viewvc?view=rev&rev=1753048 It seems the extraction of the call to getOperand()/getOperation() outside of the switch causes it to be called in more cases than before, namely in some of the switch-cases it was not called before, but is now, thus triggering this issue. Javen, I think we can simply undo this change for now as it was not intended to change behavior, or? The additional log can stay in place, naturally. Dominik. On Mon, Aug 15, 2016 at 6:09 PM, Javen O'Neal wrote: > > * 4 times NullPointerException in XSLFTextParagraph.getDefaultFontSize() > I opened bug 60005 [1] to fix the NPE in XSLFTextParagraph. > getDefaultFontSize() > This has been fixed. > > > * 456 times: ArrayIndexOutOfBoundsException in SprmOperation.getOperand() > The assumption that the operand length is "surely shorter than an int" > seems to incorrect for some files, since the test failed with AIOOB 4, > meaning the operandLength is at least 5 bytes. > I would need to look at the HWPF document specification ([MS-DOC].pdf, > [3]) to see if the operandLength may be longer than an int. > I do not know what the best way to fix this code is > > // surely shorter than an int... > byte operandLength = _grpprl[_gOffset + 1]; > > // initialized to zeros by JVM > byte[] codeBytes = new byte[LittleEndian.INT_SIZE]; > for ( int i = 0; i < operandLength; i++ ) > if ( _gOffset + i < _grpprl.length ) > codeBytes[i] = _grpprl[_gOffset + 1 + i]; > > return LittleEndian.getInt( codeBytes, 0 ); > > > [1] https://bz.apache.org/bugzilla/show_bug.cgi?id=60005 > [2] https://svn.apache.org/viewvc/poi/trunk/src/scratchpad/src/ > org/apache/poi/hwpf/sprm/SprmOperation.java?revision= > 1753052&view=markup#l106 > [3] https://interoperability.blob.core.windows.net/files/ > OfficeFileFormatsProtocols.zip > > On Mon, Aug 15, 2016 at 3:09 AM, Dominik Stadler > wrote: > > Hi, > > > > Running the regression tests for POI 3.15-beta3 against the CommonCrawl > > corpus is now finished, initial results are as follows: > > > > * 11966 fail because I did not add commons-collections4, I'll trigger a > > re-run to get document-counts correctly show the number of regressing > > documents > > > > * 456 times: ArrayIndexOutOfBoundsException in SprmOperation.getOperand() > > > > java.lang.RuntimeException: java.lang.ArrayIndexOutOfBoundsException: * > > at o.a.p.hwpf.extractor.WordExtractor.getText( > WordExtractor.java:317) > > at o.a.p.stress.AbstractFileHandler.handleExtractingInternal( > AbstractFileHandler.java:85) > > at o.a.p.stress.AbstractFileHandler.handleExtracting( > AbstractFileHandler.java:60) > > at org.dstadler.commoncrawl.FileHandlingRunnable.run( > FileHandlingRunnable.java:58) > > > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 4 > > at o.a.p.hwpf.sprm.SprmOperation.getOperand(SprmOperation.java: > 113) > > at o.a.p.hwpf.sprm.SectionSprmUncompressor. > unCompressSEPOperation(SectionSprmUncompressor.java:62) > > at o.a.p.hwpf.sprm.SectionSprmUncompressor.uncompressSEP( > SectionSprmUncompressor.java:44) > > at o.a.p.hwpf.model.SEPX.getSectionProperties(SEPX.java:61) > > at o.a.p.hwpf.usermodel.Section.(Section.java:36) > > at o.a.p.hwpf.usermodel.Range.getSection(Range.java:745) > > at o.a.p.hwpf.converter.AbstractWordConverter.processDocument( > AbstractWordConverter.java:721) > > at o.a.p.hwpf.extractor.WordExtractor.getText( > WordExtractor.java:299) > > ... 9 more > > > > * 4 times NullPointerException in XSLFTextParagraph.getDefaultFontSize() > > > > java.lang.NullPointerException > > at o.a.p.xslf.usermodel.XSLFTextParagraph.getDefaultFontSize( > XSLFTextParagraph.java:935) > > at o.a.p.sl.draw.DrawTextParagraph.getAttributedString( > DrawTextParagraph.java:567) > > at o.a.p.sl.draw.DrawTextParagraph.breakText( > DrawTextParagraph.java:235) > > at o.a.p.sl.draw.DrawTextShape.drawParagraphs(DrawTextShape. > java:158) > > at o.a.p.sl.draw.DrawTextShape.getTextHeight(DrawTextShape. > java:219) > > at o.a.p.sl.draw.DrawTextShape.drawContent(DrawTextShape. > java:102) > > at o.a.p.sl.draw.DrawSimpleShape.draw(DrawSimpleShape.java:93) > > at o.a.p.sl.draw.DrawSheet.draw(DrawSheet.java:67) > > at o.a.p.sl.draw.DrawSlide.draw(DrawSlide.java:39) > > at o.a.p.xslf.usermodel.XSLFSlide.draw(XSLFSlide.java:301) > > at o.a.p.stress.SlideShowHandler.renderSlides(SlideShowHandler. > java:120) > > at o.a.p.stress.SlideShowHandler.handleSlideShow( > SlideShowHandler.java:43) > > at o.a.p.stress.XSLFFileHandler.handleFile(XSLFFileHandler. > java:43) > > at org.dstadler.commoncrawl.FileHandlingRunnable.run( > FileHandlingRunnable.java:58) > > > > > > > > The others are probably flaky things where files caused OOM/Timeout > before > > and thus were not reported with these errors before. > > > > > > See http://people.apache.org/~centic/poi_regression/reports/ and > > http://people.apache.org/~centic/poi_regression/reportsAll/ for detailed > > results. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org > For additional commands, e-mail: dev-help@poi.apache.org > > --001a114416b89cb3a9053a3661fa--