Return-Path: Delivered-To: apmail-hadoop-pig-dev-archive@www.apache.org Received: (qmail 35752 invoked from network); 1 Apr 2010 00:34:50 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 1 Apr 2010 00:34:50 -0000 Received: (qmail 76762 invoked by uid 500); 1 Apr 2010 00:34:50 -0000 Delivered-To: apmail-hadoop-pig-dev-archive@hadoop.apache.org Received: (qmail 76748 invoked by uid 500); 1 Apr 2010 00:34:50 -0000 Mailing-List: contact pig-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: pig-dev@hadoop.apache.org Delivered-To: mailing list pig-dev@hadoop.apache.org Received: (qmail 76740 invoked by uid 99); 1 Apr 2010 00:34:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Apr 2010 00:34:50 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Apr 2010 00:34:48 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 3EEC3234C4C8 for ; Thu, 1 Apr 2010 00:34:27 +0000 (UTC) Message-ID: <143835218.621711270082067256.JavaMail.jira@brutus.apache.org> Date: Thu, 1 Apr 2010 00:34:27 +0000 (UTC) From: "Chao Wang (JIRA)" To: pig-dev@hadoop.apache.org Subject: [jira] Updated: (PIG-1342) [Zebra] Avoid making unnecessary name node calls for writes in Zebra In-Reply-To: <1671334511.593131269994467181.JavaMail.jira@brutus.apache.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/PIG-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Wang updated PIG-1342: --------------------------- Attachment: PIG-1342.patch > [Zebra] Avoid making unnecessary name node calls for writes in Zebra > -------------------------------------------------------------------- > > Key: PIG-1342 > URL: https://issues.apache.org/jira/browse/PIG-1342 > Project: Pig > Issue Type: Improvement > Affects Versions: 0.6.0, 0.7.0 > Reporter: Chao Wang > Assignee: Chao Wang > Fix For: 0.8.0 > > Attachments: PIG-1342.patch > > > Currently, table and column group level meta data is extracted from job configuration object and written onto HDFS disk within checkOutputSpec(). Later on, writers at back end will open these files to access the meta data for doing writes. This puts extra load to name node since all writers need to make name node calls to open files. > We propose the following approach to this problem: > For writers at back end, they extract meta information from job configuration object directly, rather than making name node calls and going to HDFS disk to fetch the information. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.