Return-Path: X-Original-To: apmail-crunch-dev-archive@www.apache.org Delivered-To: apmail-crunch-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8F377D2CA for ; Thu, 23 May 2013 19:57:53 +0000 (UTC) Received: (qmail 64044 invoked by uid 500); 23 May 2013 19:57:53 -0000 Delivered-To: apmail-crunch-dev-archive@crunch.apache.org Received: (qmail 64024 invoked by uid 500); 23 May 2013 19:57:53 -0000 Mailing-List: contact dev-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@crunch.apache.org Delivered-To: mailing list dev@crunch.apache.org Received: (qmail 64016 invoked by uid 99); 23 May 2013 19:57:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 May 2013 19:57:53 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of harsh@cloudera.com designates 209.85.223.170 as permitted sender) Received: from [209.85.223.170] (HELO mail-ie0-f170.google.com) (209.85.223.170) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 May 2013 19:57:47 +0000 Received: by mail-ie0-f170.google.com with SMTP id aq17so9946177iec.15 for ; Thu, 23 May 2013 12:57:26 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-gm-message-state; bh=oJyGaQUCxKCV9DKf8VAgNZtoz5oFsxdQWqy74Czr4hI=; b=NbK5CMH905vuBCb7q8EJRlOXwEsZKxjJw62F9ptV72MKGBoWMeQRxQgKUVIDdsoXTH WFBHNreRNu9Mj6ORSxBTlAwBws6r4grKJy1RrTQrYpQYRHB2Fdnqe/HzoQjM+wQRNKi3 4yN+8sEfw59+SY5VvMxaBg9tpz545iyB6adcMneq99e7XAwzjEJcEpjqSyuDXCPXQEcU hfc0ESOwuQywVbIsa5xf8lc6yACES3mx+MFfCaFFaa/TAtOyqDK+6gHApp7GB5Qgg+cH T/i/qXxA9x/nv+z71fywHB2B7utNY66CBIFAZcfxe+/ldylaQUkF6bVYo/Hv+lHCIW2f FSzA== X-Received: by 10.42.11.74 with SMTP id t10mr11193351ict.49.1369339046089; Thu, 23 May 2013 12:57:26 -0700 (PDT) MIME-Version: 1.0 Received: by 10.50.33.39 with HTTP; Thu, 23 May 2013 12:57:05 -0700 (PDT) In-Reply-To: References: <77F0498D3B83AB409C64E0CEE52178410445CDF5@mbx025-e1-nj-6.exch025.domain.local> <3955725B-C06F-4288-90B0-F7815F2A8DE1@gmail.com> From: Harsh J Date: Fri, 24 May 2013 01:27:05 +0530 Message-ID: Subject: Re: [jira] [Updated] (CRUNCH-209) Jobs with large numbers of directory inputs will fail with odd inputsplit exceptions To: dev@crunch.apache.org Content-Type: multipart/alternative; boundary=20cf301d425a6be96e04dd681731 X-Gm-Message-State: ALoCoQnHSBaRGhNGqMpStokgCZPzX05/BSM8iWCZaMhnZRJMOQ5F4uCRgXM78GomquZ/qigSFcJL X-Virus-Checked: Checked by ClamAV on apache.org --20cf301d425a6be96e04dd681731 Content-Type: text/plain; charset=ISO-8859-1 There is a limit in MR1, mapred.user.jobconf.limit per http://hadoop.apache.org/docs/stable/mapred-default.html, that limits it to 5 MB (but this is applied at the JT level). I am not aware of any serialization-time limits and think there are none as I've seen Hive use the same code to write enormous sized files. Worth noting that MR2, suitable to its service-less architecture, has no such limits on jobconf size and the property isn't present in it anymore. On Fri, May 24, 2013 at 12:13 AM, Josh Wills wrote: > On Thu, May 23, 2013 at 11:39 AM, Gabriel Reid >wrote: > > > Yep, definitely looks like an improvement! > > > > What was the actual cause of John's issue in the beginning? Is there a > > physical > > limit (or bug) in the serialization of Configuration values? > > > > It seems like there must be, although I couldn't figure out where it was > happening exactly, and Googling around for limits about jobconf > serialization didn't turn up anything, either. > > > > > > - Gabriel > > > > On 23 May 2013, at 20:26, Josh Wills wrote: > > > > > Glorious. That had been on my TODO list for awhile, I'm glad we found a > > > problem that forced me to fix it. ;-) Will commit to master. We should > > also > > > probably consider a point release (0.6.1) with that fix, esp. due to > the > > > startup improvements. > > > > > > J > > > > > > > > > On Thu, May 23, 2013 at 11:00 AM, John Jensen < > jensen@richrelevance.com > > >wrote: > > > > > >> > > >> Thanks, Josh. That worked perfectly! > > >> > > >> It has the added benefit of dramatically improving the startup time. I > > >> assume because we're no longer copying the monstrous jobconfs around. > > >> > > >> -- John > > >> > > >> ________________________________________ > > >> From: Josh Wills (JIRA) [jira@apache.org] > > >> Sent: Wednesday, May 22, 2013 5:27 PM > > >> To: crunch-dev@incubator.apache.org > > >> Subject: [jira] [Updated] (CRUNCH-209) Jobs with large numbers of > > >> directory inputs will fail with odd inputsplit exceptions > > >> > > >> [ > > >> > > > https://issues.apache.org/jira/browse/CRUNCH-209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > > ] > > >> > > >> Josh Wills updated CRUNCH-209: > > >> ------------------------------ > > >> > > >> Attachment: CRUNCH-209.patch > > >> > > >> A hypothetical fix for John to test out. > > >> > > >>> Jobs with large numbers of directory inputs will fail with odd > > >> inputsplit exceptions > > >>> > > >> > > > ------------------------------------------------------------------------------------ > > >>> > > >>> Key: CRUNCH-209 > > >>> URL: https://issues.apache.org/jira/browse/CRUNCH-209 > > >>> Project: Crunch > > >>> Issue Type: Bug > > >>> Components: Core > > >>> Affects Versions: 0.5.0, 0.6.0 > > >>> Reporter: Josh Wills > > >>> Assignee: Josh Wills > > >>> Attachments: CRUNCH-209.patch > > >>> > > >>> > > >>> From John Jensen on the user mailing list: > > >>> I have a curious problem when running a crunch job on (avro) files > in a > > >> fairly large set of directories (just slightly less than 100). > > >>> After running some fraction of the mappers they start failing with > the > > >> exception below. Things work fine with a smaller number of > directories. > > >>> The magic > > >> > > > 'zdHJpbmcifSx7Im5hbWUiOiJ2YWx1ZSIsInR5cGUiOiJzdHJpbmcifV19fSwiZGVmYXVsdCI' > > >> string shows up in the 'crunch.inputs.dir' entry in the job config, > so I > > >> assume it has something to do with deserializing that value, but > reading > > >> through the code I don't see any obvious way how. > > >>> Furthermore, the crunch.inputs.dir config entry is just under 1.5M, > so > > >> it would not surprise me if I'm running up against a hadoop limit > > somewhere. > > >>> Stack trace: > > >>> java.io.IOException: Split class zdHJp > > >>> bmcifSx7Im5hbWUiOiJ2YWx1ZSIsInR5cGUiOiJzdHJpbmcifV19fSwiZGVmYXVsdCI > not > > >> found > > >>> at > > >> org.apache.hadoop.mapred.MapTask.getSplitDetails(MapTask.java:342) > > >>> at > org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:614) > > >>> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:325) > > >>> at org.apache.hadoop.mapred.Child$4.run(Child.java:268) > > >>> at java.security.AccessController.doPrivileged(Native Method) > > >>> at javax.security.auth.Subject.doAs(Subject.java:415) > > >>> at > > >> > > > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408) > > >>> at org.apache.hadoop.mapred.Child.main(Child.java:262) > > >>> Caused by: java.lang.ClassNotFoundException: Class zdHJp > > >>> bmcifSx7Im5hbWUiOiJ2YWx1ZSIsInR5cGUiOiJzdHJpbmcifV19fSwiZGVmYXVsdCI > not > > >> found > > >>> at > > >> > > > org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:1493) > > >>> at > > >> org.apache.hadoop.mapred.MapTask.getSplitDetails(MapTask.java:340) > > >>> ... 7 more > > >> > > >> -- > > >> This message is automatically generated by JIRA. > > >> If you think it was sent incorrectly, please contact your JIRA > > >> administrators > > >> For more information on JIRA, see: > > http://www.atlassian.com/software/jira > > >> > > > > > > > > > > > > -- > > > Director of Data Science > > > Cloudera > > > Twitter: @josh_wills > > > > > -- Harsh J --20cf301d425a6be96e04dd681731--