Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F31011087D for ; Wed, 10 Jul 2013 17:16:40 +0000 (UTC) Received: (qmail 39012 invoked by uid 500); 10 Jul 2013 17:16:32 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 38468 invoked by uid 500); 10 Jul 2013 17:16:31 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 38441 invoked by uid 99); 10 Jul 2013 17:16:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Jul 2013 17:16:30 +0000 X-ASF-Spam-Status: No, hits=2.7 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,REPTO_QUOTE_YAHOO,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [98.138.91.230] (HELO nm8-vm5.bullet.mail.ne1.yahoo.com) (98.138.91.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Jul 2013 17:16:20 +0000 Received: from [98.138.101.128] by nm8.bullet.mail.ne1.yahoo.com with NNFMP; 10 Jul 2013 17:15:58 -0000 Received: from [98.138.101.164] by tm16.bullet.mail.ne1.yahoo.com with NNFMP; 10 Jul 2013 17:15:58 -0000 Received: from [127.0.0.1] by omp1075.mail.ne1.yahoo.com with NNFMP; 10 Jul 2013 17:15:58 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 742486.54682.bm@omp1075.mail.ne1.yahoo.com Received: (qmail 94795 invoked by uid 60001); 10 Jul 2013 17:15:58 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1373476558; bh=uEnaLsCHHj8pmlvf5SpI020Wom0Yc5yQfru8Mdso2UA=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=NqDi/+YQSvuTzTqB9Vz1ygg2hi4bvTu8X5RxcBeVTGusHM0/CYKYVgkxj8HZJHWEXO50U610VlYKZCVLERR5LAYgyf1xKFKw/xB9Ysi4SJtzuk3sGGX93ry+0QfqhAzyi15gabS8SO+UwAkDXk9PHFWSqTWZ9Jy/6JGWZEhWUVo= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=zOPm3aDsIt2KUZ1eNJ8uniyxSLEI95JVMdXwcp76CadkukAkU3iKP1CTPiiddPeNlKAlokepeMF38sxYeQRLTS+jVFvl2Ui3IuUKgirB9gAdnvjKdSHfPIfD6izFE3uHYqvaujny8Vhcy+KgoaK2eJa4lPqI3Ue9uovN7yKMCTQ= ; X-YMail-OSG: PJG1j7QVM1mkeOuwYd1m3cOqLeiF.iZVMlkTrW5JaShfWC0 gS3nV9SPPgMIuD5.YGC6KDwHYlDbqhSm92agNguLpciAWW0vLMJuj0SB3hFa Zf85I7agysg171b9Enh3oT5DbP4JFMDQ0vANZiEkl5gkA0FBHuAW4_IQbMKp idx9VUDLVr8gIMH3AxyAxYngfkjofst10VDaJlNpHmCqaeL8EYXZMozXmyia BCbIVwDolqk27QGk8GPY_7POOwTydcuhYb1EEQe1OMC2xz.ZhJiv6BKF7L.m SHn6odaV2ihapG5tQxQBJYQfW7zK87HoaAMVVKbFJ6KbLO8cX19ifr7pOATf vkmaca0uaQUuKArJz4oeqsciZEjECI4otH1DE2PstDU46hferFw09l5qx7HS yBpRYI3k1uou3YlBpfnYDgrBv48CixKQqJTykbgMJFZsHsbJ70fJ7WUP6G65 WCha_gRvHxNDJ5c0lD_jxvVceF5CWCb4yOl2KVPeQHyf7LV70CV8qYr6lISa M1ht56gqT0Vv4l.mYkY7OBuZ0Nvys_UntHeuyKe709ovjhJHxlU.PIosmMMq g6E5aIZsII3tYwPnkTJsGJkiveZRvpRG6TZyElfePVG9d32.kfqnV Received: from [216.113.169.239] by web120903.mail.ne1.yahoo.com via HTTP; Wed, 10 Jul 2013 10:15:58 PDT X-Rocket-MIMEInfo: 002.001,SGkgQXp1cnl5LCBJIGFtIHRlc3RpbmcgdGhlIHdheSB5b3Ugc3VnZ2VzdGVkLiBOb3cgSSBhbSBmYWNpbmcgYSBjb21waWxhdGlvbiBlcnJvciBmb3IgdGhlIGZvbGxvd2luZyBzdGF0ZW1lbnQ6CmNvbmYuc2V0KFRhYmxlSW5wdXRGb3JtYXQuU0NBTiwgVGFibGVNYXBSZWR1Y2VVdGlsLmNvbnZlcnRTY2FuVG9TdHJpbmcobmV3IFNjYW4oKSkpOwoKClRoZSBlcnJvciBpczogIm1ldGhvZCBjb252ZXJ0U2NhblRvU3RyaW5nIGlzIG5vdCB2aXNpYmxlIGluIFRhYmxlTWFwUmVkdWNlVXRpbCIuIENvdWxkIHUgaGUBMAEBAQE- X-Mailer: YahooMailWebService/0.8.148.557 References: <1372826091.75664.YahooMailNeo@web120903.mail.ne1.yahoo.com> <1372865652.49137.YahooMailNeo@web120904.mail.ne1.yahoo.com> Message-ID: <1373476558.93417.YahooMailNeo@web120903.mail.ne1.yahoo.com> Date: Wed, 10 Jul 2013 10:15:58 -0700 (PDT) From: "S. Zhou" Reply-To: "S. Zhou" Subject: Re: MapReduce job with mixed data sources: HBase table and HDFS files To: "user@hbase.apache.org" In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="357496409-1515064781-1373476558=:93417" X-Virus-Checked: Checked by ClamAV on apache.org --357496409-1515064781-1373476558=:93417 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Hi Azuryy, I am testing the way you suggested. Now I am facing a compilatio= n error for the following statement:=0Aconf.set(TableInputFormat.SCAN, Tabl= eMapReduceUtil.convertScanToString(new Scan()));=0A=0A=0AThe error is: "met= hod convertScanToString is not visible in TableMapReduceUtil". Could u help= ? It blocks me. =0A=0A=0ABTW, I am using the HBase-server jar file version = 0.95.1-hadoop1 . I tried other versions as well like 0.94.9 and got the sam= e error.=0A=0AThanks!=0A=0A=0A________________________________=0A From: Azu= ryy Yu =0ATo: user@hbase.apache.org =0ASent: Wednesday,= July 3, 2013 6:02 PM=0ASubject: Re: MapReduce job with mixed data sources:= HBase table and HDFS files=0A =0A=0AHi,=0A1) It cannot input two different= cluster's data to a MR job.=0A2) If your data locates in the same cluster,= then:=0A=0A=A0 =A0 conf.set(TableInputFormat.SCAN,=0ATableMapReduceUtil.co= nvertScanToString(new Scan()));=0A=A0 =A0 conf.set(TableInputFormat.INPUT_T= ABLE, tableName);=0A=0A=A0 =A0 MultipleInputs.addInputPath(conf, new Path(i= nput_on_hdfs),=0ATextInputFormat.class, MapperForHdfs.class);=0A=A0 =A0 Mul= tipleInputs.addInputPath(conf, new Path(input_on_hbase),=0ATableInputFormat= .class, MapperForHBase.class);*=0A=0A*=0Abut,=0Anew Path(input_on_hbase) ca= n be any path, it make no sense.*=0A=0A*=0APlease refer to=0Aorg.apache.had= oop.hbase.mapreduce.IndexBuilder for how to read table in the=0AMR job unde= r $HBASE_HOME/src/example*=0A=0A=0A=0A*=0A=0A=0AOn Thu, Jul 4, 2013 at 5:19= AM, Michael Segel wrote:=0A=0A> You may want to= pull your data from your HBase first in a separate map=0A> only job and th= en use its output along with other HDFS input.=0A> There is a significant d= isparity between the reads from HDFS and from=0A> HBase.=0A>=0A>=0A> On Jul= 3, 2013, at 10:34 AM, S. Zhou wrote:=0A>=0A> > Azuryy, = I am looking at the MultipleInputs doc. But I could not figure=0A> out how = to add HBase table as a Path to the input? Do you have some sample=0A> code= ? Thanks!=0A> >=0A> >=0A> >=0A> >=0A> > ________________________________=0A= > > From: Azuryy Yu =0A> > To: user@hbase.apache.org; S= . Zhou =0A> > Sent: Tuesday, July 2, 2013 10:06 PM=0A> > = Subject: Re: MapReduce job with mixed data sources: HBase table and HDFS=0A= > files=0A> >=0A> >=0A> > Hi ,=0A> >=0A> > Use MultipleInputs, which can so= lve your problem.=0A> >=0A> >=0A> > On Wed, Jul 3, 2013 at 12:34 PM, S. Zho= u wrote:=0A> >=0A> >> Hi there,=0A> >>=0A> >> I know how= to create MapReduce job with HBase data source only or HDFS=0A> >> file as= data source. Now I need to create a MapReduce job with mixed=0A> data=0A> = >> sources, that is, this MR job need to read data from both HBase and HDFS= =0A> >> files. Is it possible? If yes, could u share some sample code?=0A> = >>=0A> >> Thanks!=0A> >> Senqiang=0A>=0A> --357496409-1515064781-1373476558=:93417--