Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 16645 invoked from network); 2 Apr 2009 18:59:48 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 2 Apr 2009 18:59:48 -0000 Received: (qmail 12196 invoked by uid 500); 2 Apr 2009 18:59:46 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 12112 invoked by uid 500); 2 Apr 2009 18:59:46 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 12102 invoked by uid 99); 2 Apr 2009 18:59:46 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 02 Apr 2009 18:59:46 +0000 X-ASF-Spam-Status: No, hits=2.4 required=10.0 tests=HTML_MESSAGE,SPF_PASS,WHOIS_MYPRIVREG X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of airbot@gmail.com designates 74.125.44.30 as permitted sender) Received: from [74.125.44.30] (HELO yx-out-2324.google.com) (74.125.44.30) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 02 Apr 2009 18:59:38 +0000 Received: by yx-out-2324.google.com with SMTP id 3so434667yxj.29 for ; Thu, 02 Apr 2009 11:59:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=2d3ThPCmTAC6WwCXC5AS5PL1nyvsUbUnmVKcq/521QE=; b=N7dfjPrizC/BDIe+DGhgwlI0aWQdtZx/gRZuZ35u0PNoUYT4Q2WbX17Y3rluBSn7Q2 9Kof8535xadDrxj03eZa8d9P4CccB1VvWtpuXegnqXcMp3Too2UrF1Gprq+YHZwn3Qmz YZvDBOpmQnanj+MkP0G1pG7tNumVN3yabNUYo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=DKYGvuWHAdhKymsNQ0ZDeZC5TTEcLSq7mtY6MLu2elWrDGVvXIMtAkrH3ihZkKKa7C zYEoqvzN1INNvR+oU44E566eWnMJHrtd8zMeMBjeJ6b8Hoc1TNwocqUkVjLObh5iWmKm /Ck/kUZ32rsoCci977nK0LysSVfVumToIWYQs= MIME-Version: 1.0 Received: by 10.151.47.7 with SMTP id z7mr644858ybj.64.1238698757010; Thu, 02 Apr 2009 11:59:17 -0700 (PDT) In-Reply-To: <3df32b6d0904021012r2619938cjb6185f7ee7ab38f4@mail.gmail.com> References: <22844277.post@talk.nabble.com> <22848070.post@talk.nabble.com> <3df32b6d0904021012r2619938cjb6185f7ee7ab38f4@mail.gmail.com> Date: Thu, 2 Apr 2009 13:59:16 -0500 Message-ID: Subject: Re: HELP: I wanna store the output value into a list not write to the disk From: He Chen To: core-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=0015174c352c6537b60466970755 X-Virus-Checked: Checked by ClamAV on apache.org --0015174c352c6537b60466970755 Content-Type: text/plain; charset=ISO-8859-2 Content-Transfer-Encoding: quoted-printable It seems like the InMemoryFileSystem class has been deprecated in Hadoop 0.19.1. Why? I want to reuse the result of reduce as the next time map's input. Cascadin= g does not work, because the data of each step is dependent. I set each timestep mapreduce job as synchronization. If the InMemoryFileSystem is deprecated. How can I reduce the I/O for each timestep's mapreduce job. 2009/4/2 Farhan Husain > Is there a way to implement some OutputCollector that can do what Andy > wants > to do? > > On Thu, Apr 2, 2009 at 10:21 AM, Rasit OZDAS wrote= : > > > Andy, I didn't try this feature. But I know that Yahoo had a > > performance record with this file format. > > I came across a file system included in hadoop code (probably that > > one) when searching the source code. > > Luckily I found it: org.apache.hadoop.fs.InMemoryFileSystem > > But if you have a lot of big files, this approach won't be suitable I > > think. > > > > Maybe someone can give further info. > > > > 2009/4/2 andy2005cst : > > > > > > thanks for your reply. Let me explain more clearly, since Map Reduce = is > > just > > > one step of my program, I need to use the output of reduce for furtur= e > > > computation, so i do not need to want to wirte the output into disk, > but > > > wanna to get the collection or list of the output in RAM. if it > directly > > > wirtes into disk, I have to read it back into RAM again. > > > you have mentioned a special file format, will you please show me wha= t > is > > > it? and give some example if possible. > > > > > > thank you so much. > > > > > > > > > Rasit OZDAS wrote: > > >> > > >> Hi, hadoop is normally designed to write to disk. There are a specia= l > > file > > >> format, which writes output to RAM instead of disk. > > >> But I don't have an idea if it's what you're looking for. > > >> If what you said exists, there should be a mechanism which sends > output > > as > > >> objects rather than file content across computers, as far as I know > > there > > >> is > > >> no such feature yet. > > >> > > >> Good luck. > > >> > > >> 2009/4/2 andy2005cst > > >> > > >>> > > >>> I need to use the output of the reduce, but I don't know how to do. > > >>> use the wordcount program as an example if i want to collect the > > >>> wordcount > > >>> into a hashtable for further use, how can i do? > > >>> the example just show how to let the result onto disk. > > >>> myemail is : andy2005cst@gmail.com > > >>> looking forward your help. thanks a lot. > > >>> -- > > >>> View this message in context: > > >>> > > > http://www.nabble.com/HELP%3A-I-wanna-store-the-output-value-into-a-list-= not-write-to-the-disk-tp22844277p22844277.html > > >>> Sent from the Hadoop core-user mailing list archive at Nabble.com. > > >>> > > >>> > > >> > > >> > > >> -- > > >> M. Ra=BAit =D6ZDA=AA > > >> > > >> > > > > > > -- > > > View this message in context: > > > http://www.nabble.com/HELP%3A-I-wanna-store-the-output-value-into-a-list-= not-write-to-the-disk-tp22844277p22848070.html > > > Sent from the Hadoop core-user mailing list archive at Nabble.com. > > > > > > > > > > > > > > -- > > M. Ra=BAit =D6ZDA=AA > > > > > > -- > Mohammad Farhan Husain > Research Assistant > Department of Computer Science > Erik Jonsson School of Engineering and Computer Science > University of Texas at Dallas > --=20 Chen He RCF CSE Dept. University of Nebraska-Lincoln US --0015174c352c6537b60466970755--