Return-Path: X-Original-To: apmail-incubator-crunch-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-crunch-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A1DC4E229 for ; Tue, 5 Feb 2013 03:20:01 +0000 (UTC) Received: (qmail 22962 invoked by uid 500); 5 Feb 2013 03:20:00 -0000 Delivered-To: apmail-incubator-crunch-user-archive@incubator.apache.org Received: (qmail 22868 invoked by uid 500); 5 Feb 2013 03:20:00 -0000 Mailing-List: contact crunch-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: crunch-user@incubator.apache.org Delivered-To: mailing list crunch-user@incubator.apache.org Received: (qmail 22826 invoked by uid 99); 5 Feb 2013 03:19:58 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Feb 2013 03:19:58 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jwills@cloudera.com designates 209.85.220.182 as permitted sender) Received: from [209.85.220.182] (HELO mail-vc0-f182.google.com) (209.85.220.182) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Feb 2013 03:19:52 +0000 Received: by mail-vc0-f182.google.com with SMTP id fl17so4408907vcb.27 for ; Mon, 04 Feb 2013 19:19:32 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type:x-gm-message-state; bh=xgyevRNzCOHZts6M4gt6NQJq/yWONJ1f07pX0CWRg/A=; b=nw3wuUJG5mojcRL+KVprZNFZCPMTGz/oBH3GQJOYExlYzcqJyZlJuGaL+FwDOhsHYz vlXoSXsfc0Vp1RGhdRqhoXQbtOnX0VgnRcnzhs267LTdZEgLBdjdzkaKnO7An4Dx2uq0 DfCF7EE8nQKtipcsWj/m69+uht9eRsZgV2t0r29WUv0U4etzmw8eLa6C+0Y9/Eg1gEbN zIcX2rv5IGsl2cA2zdA7dCKJVpDd2HXeS8BkFHOkjhcaVwuobpiesWh7kArmZnU1IW4o DyQucyptTSVh4Xf43F7NpX0ExdDDsyc39OGVALqNr+rbUZtIMWUO3MYlTEbwJKfKZRwD XG9w== X-Received: by 10.58.106.161 with SMTP id gv1mr21724146veb.35.1360034372092; Mon, 04 Feb 2013 19:19:32 -0800 (PST) MIME-Version: 1.0 Received: by 10.58.210.102 with HTTP; Mon, 4 Feb 2013 19:19:12 -0800 (PST) In-Reply-To: References: From: Josh Wills Date: Mon, 4 Feb 2013 19:19:12 -0800 Message-ID: Subject: Re: Visualize DAG of a pipeline To: "crunch-user@incubator.apache.org" , Gabriel Reid Content-Type: multipart/alternative; boundary=047d7bacc178a184b704d4f1ad04 X-Gm-Message-State: ALoCoQknKT+q2Nsyl89rLnDFDHjqN4izKkFm3i14Y55680yQaDAKmJ23zuhL3uMwkovYpDOlMCfK X-Virus-Checked: Checked by ClamAV on apache.org --047d7bacc178a184b704d4f1ad04 Content-Type: text/plain; charset=ISO-8859-1 +greid Gabriel wrote one, IIRC-- I think that a .dot file with the plan for the job gets embedded in the Configuration object returned from the planner. On Mon, Feb 4, 2013 at 7:13 PM, Chao Shi wrote: > Hi crunch users, > > I would like to know if there are any tool to help me understand crunch > optimized MR stages. > > Particularly, I think I need to see the DAG of job stages. I'm writing a > pipeline consists of several joins. The pipeline produces significant > more intermediate output than I expect. I want to investigate what's going > wrong there. > > Thanks, > Chao > -- Director of Data Science Cloudera Twitter: @josh_wills --047d7bacc178a184b704d4f1ad04 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
+greid

Gabriel wrote one, IIRC-- I think that a .do= t file with the plan for the job gets embedded in the Configuration object = returned from the planner.


On Mon, Feb 4, 2013 at 7:13 PM, Chao Shi <stepinto@live.com>= wrote:
Hi crunch users,

I would = like to know if there are any tool to help me understand crunch optimized M= R stages.

Particularly, I think I need to see the DAG of job stag= es. I'm writing a pipeline consists of several joins. The pipeline prod= uces significant more=A0intermediate=A0output than I expect. I want to inve= stigate what's going wrong there.

Thanks,
Chao



--
Directo= r of Data Science
Twitter: @josh_wills
--047d7bacc178a184b704d4f1ad04--