Return-Path: X-Original-To: apmail-tajo-dev-archive@minotaur.apache.org Delivered-To: apmail-tajo-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F37D810BA0 for ; Fri, 6 Dec 2013 00:55:01 +0000 (UTC) Received: (qmail 737 invoked by uid 500); 6 Dec 2013 00:55:01 -0000 Delivered-To: apmail-tajo-dev-archive@tajo.apache.org Received: (qmail 665 invoked by uid 500); 6 Dec 2013 00:55:01 -0000 Mailing-List: contact dev-help@tajo.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tajo.incubator.apache.org Delivered-To: mailing list dev@tajo.incubator.apache.org Received: (qmail 656 invoked by uid 99); 6 Dec 2013 00:55:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 Dec 2013 00:55:01 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 06 Dec 2013 00:54:58 +0000 Received: (qmail 284 invoked by uid 99); 6 Dec 2013 00:54:36 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 Dec 2013 00:54:36 +0000 Date: Fri, 6 Dec 2013 00:54:36 +0000 (UTC) From: "Keuntae Park (JIRA)" To: dev@tajo.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (TAJO-317) Improve TajoResourceManager to support more elaborate resource management MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/TAJO-317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13840755#comment-13840755 ] Keuntae Park commented on TAJO-317: ----------------------------------- You are right, Jihoon. I'm confused between default configuration and my working configuration :) I agree with your fixed documentation, which is correct and much easier to understand. Thank you for the review !! > Improve TajoResourceManager to support more elaborate resource management > ------------------------------------------------------------------------- > > Key: TAJO-317 > URL: https://issues.apache.org/jira/browse/TAJO-317 > Project: Tajo > Issue Type: Improvement > Components: resource manager > Reporter: Hyunsik Choi > Assignee: Keuntae Park > Fix For: 0.8-incubating > > Attachments: TAJO-317.doc.patch, TAJO-317.doc_2.patch, TAJO-317.patch, TAJO-317_2.patch, TAJO-317_3.patch, TAJO-317_4.patch, TAJO-317_5.patch > > > h3. Status of the current Tajo Resource Manager (RM) > * Tajo RM manages CPU, DISK resource incompletely, and it only provides resource management through memory allocations. > * In addition, Tajo RM considers the memory resource as the fixed number of slots. > h3. Problem > In many cases, workloads can be categorized into I/O intensive job and CPU and memory consuming job. For example, scan and hash partition or INSERT OVERWRITE may be belong to I/O intensive job. In general, Aggregation can be belong to CPU-memory consuming job. The current RM is not fit to support selectively I/O intensive job or CPU-memory consuming job because it provides only memory slots. We need more elaborate resource management mechanism. > In addition, in most resource management systems, the remain resource less than required resource is not allocated in response to a resource request. It is not good to fully utilize the cluster resources. In order to mitigate this problem, we need to add resilience to allocation mechanism. For example, min-max request would be useful for it. > h3. Proposal > * Tajo RM should provides resource management for disk and cpu-memory. > ** Tajo RM should provide allocation request call with min, max memory request, and min, max disk request. > *** min-max request will be useful to fully utilize remain cluster resources. > * Each resource request should have a priority. The priority can be disk or memory. > ** If the priority is disk > *** disk allocation will be limited depending on the remain disk resource > *** memory allocation will be not limited regardless of the remain memory resource, and just reduce the remain memory resource. > ** If the priority is memory > *** memory allocation will be limited depending on the remain memory resource > *** disk allocation will be not limited regardless of the remain disk resource, and just reduce the remain disk resource. > * disk resource in each worker is represented as a float value. > ** The initial disk resource will be the number of disks which participate in HDFS data directory. -- This message was sent by Atlassian JIRA (v6.1#6144)