Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 208ABD91D for ; Tue, 18 Dec 2012 01:13:31 +0000 (UTC) Received: (qmail 71866 invoked by uid 500); 18 Dec 2012 01:13:26 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 71758 invoked by uid 500); 18 Dec 2012 01:13:25 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 71751 invoked by uid 99); 18 Dec 2012 01:13:25 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Dec 2012 01:13:25 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of masmertoz@gmail.com designates 209.85.214.177 as permitted sender) Received: from [209.85.214.177] (HELO mail-ob0-f177.google.com) (209.85.214.177) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Dec 2012 01:13:18 +0000 Received: by mail-ob0-f177.google.com with SMTP id uo13so44815obb.22 for ; Mon, 17 Dec 2012 17:12:58 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=4cW034ttMQZFb3c3tcmqQKGKgp4oo6kVoXrlUiypx6c=; b=PoUXAfxKh2rMUoKFJkvGq8ACMXpkLpuIgd3yvL0Jk7jBMOOozRyjrDhyilcG5fTvVa OxrbFCYxQ5P8EAsrS4W7XfNAPQhywGfouXEMlSxfNlqmEphOYGXmXsMmL/MUj1nyzHk4 sJsRrdYOIsroR0ACLLwaT6+0LAdQpkO8uHZn3CcKIY36a5NgUYBNbxBPFc+H3zWw90c8 piT981i0CoP3+JVd3X4bpgXl0o2PAlpWh+b4Natl5YmUX4BFb3aCUi+pTwfKZ3RgH9TH GFxbxIuOgyguL7Qhn9jKDxvfv17/swL6kyCEOvewnLZR+biaE86Aab1j00wo4MmdmrCl +0ag== MIME-Version: 1.0 Received: by 10.60.31.84 with SMTP id y20mr184647oeh.91.1355793178372; Mon, 17 Dec 2012 17:12:58 -0800 (PST) Received: by 10.60.116.97 with HTTP; Mon, 17 Dec 2012 17:12:58 -0800 (PST) Date: Tue, 18 Dec 2012 02:12:58 +0100 Message-ID: Subject: Cluster administration advice needed From: Merto Mertek To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=e89a8fb1ee7ac929b704d116321a X-Virus-Checked: Checked by ClamAV on apache.org --e89a8fb1ee7ac929b704d116321a Content-Type: text/plain; charset=ISO-8859-1 Hi, I was working with a small Hadoop cluster while I was developing a new scheduler, however the cluster was used only for development purposes and never in production so I am wondering what obstacles are you facing in a typical day-to-day cluster administration? We have been discussing with an ad-company (which has their own development team) about building a platform with hbase, hadoop and maybe some in-memory database for caching. My part would be to establish a small cluster (~ 5nodes) that would satisfy their requirements and to monitor its behavior. Because of my current job probably I will not be available at their site for full-time, so I am wondering: a) What things are taking most of your time in cluster administration? b) How many hours should I plan to administer the cluster when the infrastructure and data is ready (probably this will be a long process) ... c) What tasks besides software updates, schema updates, monitoring, additional provisioning should I plan ? Thank you... --e89a8fb1ee7ac929b704d116321a Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi,

I was working with a small= Hadoop cluster while I was developing a new scheduler, however the cluster= was used only for development purposes and never in production so I am won= dering what obstacles are you facing in a typical day-to-day cluster admini= stration?

We have been discussing with an ad-company (which has their own develop= ment team) about building a platform with hbase, hadoop and maybe some in-m= emory database for caching. My part would be to establish a small cluster (= ~ 5nodes) that would satisfy their requirements and to monitor its behavior= . Because of my current job probably I will not be available at their site = for full-time, so I am wondering:


a) What things are taking most of your time in clu= ster administration?
b) How many hours should I plan to administ= er the cluster when the=20 infrastructure and data is ready (probably this will be a long process)=20 ...
c) What tasks besides software updates, schema updates, monitoring,= additional provisioning should I plan ?


<= br>
Thank you...









--e89a8fb1ee7ac929b704d116321a--