Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F1D6311ABA for ; Fri, 21 Feb 2014 04:21:13 +0000 (UTC) Received: (qmail 37816 invoked by uid 500); 21 Feb 2014 04:21:06 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 37319 invoked by uid 500); 21 Feb 2014 04:21:04 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 37312 invoked by uid 99); 21 Feb 2014 04:21:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Feb 2014 04:21:03 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of suresh@hortonworks.com designates 209.85.192.169 as permitted sender) Received: from [209.85.192.169] (HELO mail-pd0-f169.google.com) (209.85.192.169) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Feb 2014 04:20:55 +0000 Received: by mail-pd0-f169.google.com with SMTP id v10so2756855pde.0 for ; Thu, 20 Feb 2014 20:20:34 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :message-id:references:to:content-type:content-transfer-encoding; bh=Zb4Y6sbrcAvrRDGrCSCFMpAtrN65l3+mks4/h34dozQ=; b=Er/O5uWaWfyZlHjcncd7fe03wcdsxkXgd+VXXUEqwZhHSlXCdNKHkjHtzdc0nizxyL ZZrR/aPf7WRFlMT2bm1l4f+0CM6Q6CODSpd+XM8R6FZGL/Ih7qcfyHHCkfLVA86VEuCq otIn+02F93KjRD4TYqR6cITaH63lyp/XmXyewlD8n+463JwMEiPvfcO4emPPe1qu3GIU fjUOEwEEF/7fhfH2aEimJ/i0lx2JCZj8QWOZRosLI3bUwMbc/gFxDB24lEL3Nsh5gkhk TOrnjuiLidLjiNiqvT5tFLfX5Hh0FPdwhJpdsFhBfOU9jzRx0D9O7hvQgqM5IF2HyGTF TbIQ== X-Gm-Message-State: ALoCoQmN0fIcRxwu7ZJy7PKstlakbm+xUI/ZTJXM/aH1XHl/871MklYdk1O3BoM2UbPFzOGLREYT5R/H9q+Dy8qrYdoMvjoWYy6RaDIkUl+UedHhWQbvFqY= X-Received: by 10.66.142.42 with SMTP id rt10mr6614385pab.1.1392956434144; Thu, 20 Feb 2014 20:20:34 -0800 (PST) Received: from [192.168.1.82] (172-10-237-5.lightspeed.sntcca.sbcglobal.net. [172.10.237.5]) by mx.google.com with ESMTPSA id sx8sm39935452pab.5.2014.02.20.20.20.33 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Thu, 20 Feb 2014 20:20:33 -0800 (PST) Mime-Version: 1.0 (1.0) Subject: Re: any optimize suggestion for high concurrent write into hdfs? From: Suresh Srinivas X-Mailer: iPhone Mail (11B554a) In-Reply-To: Date: Thu, 20 Feb 2014 20:20:32 -0800 Cc: "user@hadoop.apache.org" Message-Id: <933F52AD-C9EC-4CEF-8620-38917E3937F5@hortonworks.com> References: To: "user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=Apple-Mail-9ED84BF9-B955-43BA-9463-968707A7232D Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-9ED84BF9-B955-43BA-9463-968707A7232D Content-Type: text/plain; charset=ISO-8859-1 Another alternative is to write block sized chunks into multiple hdfs files concurrently followed by concat to all those into a single file. Sent from phone > On Feb 20, 2014, at 8:15 PM, Chen Wang wrote: > > Ch, > you may consider using flume as it already has a flume sink that can sink to hdfs. What I did is to set up a flume listening on an Avro sink, and then sink to hdfs. Then in my application, i just send my data to avro socket. > Chen > > >> On Thu, Feb 20, 2014 at 5:07 PM, ch huang wrote: >> hi,maillist: >> is there any optimize for large of write into hdfs in same time ? thanks > -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. --Apple-Mail-9ED84BF9-B955-43BA-9463-968707A7232D Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Another alternative is to write bl= ock sized chunks into multiple hdfs files concurrently followed by concat t= o all those into a single file. 

Sent from phone

= On Feb 20, 2014, at 8:15 PM, Chen Wang <chen.apache.solr@gmail.com> wrote:

Ch,
you may consider using fl= ume as it already has a flume sink that can sink to hdfs. What I did is to = set up a flume listening on an Avro sink, and then sink to hdfs. Then in my= application, i just send my data to avro socket.
Chen

CONFIDENTIALITY NOTICE
NOTICE: This message is = intended for the use of the individual or entity to which it is addressed a= nd may contain information that is confidential, privileged and exempt from= disclosure under applicable law. If the reader of this message is not the = intended recipient, you are hereby notified that any printing, copying, dis= semination, distribution, disclosure or forwarding of this communication is= strictly prohibited. If you have received this communication in error, ple= ase contact the sender immediately and delete it from your system. Thank Yo= u. --Apple-Mail-9ED84BF9-B955-43BA-9463-968707A7232D--