Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 67205EBA7 for ; Mon, 18 Feb 2013 11:31:28 +0000 (UTC) Received: (qmail 95554 invoked by uid 500); 18 Feb 2013 11:31:23 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 95467 invoked by uid 500); 18 Feb 2013 11:31:23 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 95448 invoked by uid 99); 18 Feb 2013 11:31:22 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 18 Feb 2013 11:31:22 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.220.51] (HELO mail-pa0-f51.google.com) (209.85.220.51) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 18 Feb 2013 11:31:13 +0000 Received: by mail-pa0-f51.google.com with SMTP id hz1so2782968pad.10 for ; Mon, 18 Feb 2013 03:30:50 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:message-id:date:from:user-agent:mime-version:to:subject :references:in-reply-to:x-forwarded-message-id:content-type :content-transfer-encoding:x-gm-message-state; bh=smZdmbfQv6MgIEyrziqKI1Ej1MjfsaK7VSYLAYOy6FY=; b=nb9FXZz74fKmV6WUSPonsrcR3ETKSKCFTGTkpyLgoNx38Ga0DRo/AQaRjv7jAp9ROK 7TdcFaYXDRakPwl8B3p26gfoNdmgNLwg6/76/7Sj6AADRMzHgwXgDEvuIihCrs0gdn7g JXvZPoUIzfpYayMFzGHgYzav/obbgdQO2nQoX3aTDRVFqgQNX7LoIdIw6EFjsvcyzByo NcqEgzNoDdWMvXGehT8CrxkWY2Iz+AdKyDJTymvGL9WNKPT27AS+ONSkPyXnGsj3LhHp BMi4gR9cxG7d/j7Y9s6yl2MK9xnY5+xHco4fvFtCyboho7Zt/K9zVueuwoKjZw7R+TZ2 6eqA== X-Received: by 10.66.251.162 with SMTP id zl2mr23239870pac.36.1361187050608; Mon, 18 Feb 2013 03:30:50 -0800 (PST) Received: from [166.104.144.124] ([166.104.144.124]) by mx.google.com with ESMTPS id zm1sm14215803pbc.26.2013.02.18.03.30.48 (version=TLSv1 cipher=RC4-SHA bits=128/128); Mon, 18 Feb 2013 03:30:49 -0800 (PST) Message-ID: <512210E7.7040004@agape.hanyang.ac.kr> Date: Mon, 18 Feb 2013 20:30:47 +0900 From: Masoud User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130110 Thunderbird/17.0.2 MIME-Version: 1.0 To: user@hadoop.apache.org Subject: Database insertion by HAdoop References: <51220E58.8090803@agape.hanyang.ac.kr> In-Reply-To: <51220E58.8090803@agape.hanyang.ac.kr> X-Forwarded-Message-Id: <51220E58.8090803@agape.hanyang.ac.kr> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Gm-Message-State: ALoCoQmE1/oykC0gw6c2Qbx9V46aWJ675mLXMN0VUaKyHGSTnR9DvAHk1hWqJUzAW3f/WtarFU/3 X-Virus-Checked: Checked by ClamAV on apache.org Dear All, We are going to do our experiment of a scientific papers, ] We must insert data in our database for later consideration, it almost 300 tables each one has 2/000/000 records. as you know It takes lots of time to do it with a single machine, we are going to use our Hadoop cluster (32 machines) and divide 300 insertion tasks between them, I need some hint to progress faster, 1- as i know we dont need to Reduser, just Mapper in enough. 2- so wee need just implement Mapper class with needed code. Please let me know if there is any point, Best Regards Masoud