Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ACEFB17933 for ; Sun, 24 May 2015 15:43:56 +0000 (UTC) Received: (qmail 77146 invoked by uid 500); 24 May 2015 15:43:54 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 77078 invoked by uid 500); 24 May 2015 15:43:54 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 77066 invoked by uid 99); 24 May 2015 15:43:54 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 24 May 2015 15:43:54 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 7C582C7EFD for ; Sun, 24 May 2015 15:43:53 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0 X-Spam-Level: X-Spam-Status: No, score=0 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=messagingengine.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id RlF4NsEtYpdD for ; Sun, 24 May 2015 15:43:49 +0000 (UTC) Received: from out5-smtp.messagingengine.com (out5-smtp.messagingengine.com [66.111.4.29]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id E58C7428E3 for ; Sun, 24 May 2015 15:43:48 +0000 (UTC) Received: from compute5.internal (compute5.nyi.internal [10.202.2.45]) by mailout.nyi.internal (Postfix) with ESMTP id CA337206B3 for ; Sun, 24 May 2015 11:43:48 -0400 (EDT) Received: from web6 ([10.202.2.216]) by compute5.internal (MEProxy); Sun, 24 May 2015 11:43:48 -0400 DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:message-id:mime-version:subject:to:x-sasl-enc :x-sasl-enc; s=smtpout; bh=iCtDQBlduWiAOgBFh9qU4ashCv0=; b=MID8a yrSoDMOvb5nKiW6bKOlfgTARHIkpnJMF8f8j2P1xGCeeNDQKe+qgOXCeKegL3zxB v2hHQuZO/9yi/EHMmkzx18PEurY8WewCiwiiFQA26owXQovJNTCig8A4pzlf2aBO QC2ina8FnDHiGk2q3zvRXDcMVhrmTaUGFSoB5w= Received: by web6.nyi.internal (Postfix, from userid 99) id 8BE5A4B847; Sun, 24 May 2015 11:43:48 -0400 (EDT) Message-Id: <1432482228.3215255.276924801.438762B0@webmail.messagingengine.com> X-Sasl-Enc: pCXRrTJ1u4O+SAtoiEGyvz8ufr7Bj5TntlRpuNc4VuTz 1432482228 From: apache@borkbork.net To: user@hbase.apache.org MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain X-Mailer: MessagingEngine.com Webmail Interface - ajax-75d7cc2d Subject: Issues with import from 0.92 into 0.98 Date: Sun, 24 May 2015 11:43:48 -0400 Hello all- I'm hoping someone can point me in the right direction as I've exhausted all my knowledge and abilities on the topic... I've inherited an old, poorly configured and brittle CDH4 cluster running HBase 0.92. I'm attempting to migrate the data to a new Ambari cluster running HBase 0.98. I'm attempting to do this without changing anything on the old cluster as I have hard enough time keeping it running as is. Also, due to configuration issues with the old cluster (on AWS), a direct HBase to HBase table copy, or even HDFS to HDFS copy is out of the question at the moment. I was able to use the export task on the old cluster to dump the HBase tables to HDFS, which I then distcp s3n copied up to S3, then back down to the new cluster, then used the HBase importer. This appears to work fine... ... except that on the new cluster table scans with column filters do not work. A sample row looks something this: A:9223370612274019807:twtr:56935907581904486 column=x:twitter:username, timestamp=1424592575087, value=Bilo Selhi Unfortunately, even though I can see the column is properly defined, I cannot filter on it: hbase(main):015:0> scan 'content' , {LIMIT=>10, COLUMNS=>'x:twitter:username'} ROW COLUMN+CELL 0 row(s) in 352.7990 seconds Any ideas what the heck is going here? Here's the rough process I used for the export/import: Old cluster: $ hbase org.apache.hadoop.hbase.mapreduce.Driver export content hdfs:///hbase_content $ hadoop distcp -Dfs.s3n.awsAccessKeyId='xxxx' -Dfs.s3n.awsSecretAccessKey='xxxx' -i hdfs:///hbase_content s3n://hbase_content New cluster: $ hadoop distcp -Dfs.s3n.awsAccessKeyId='xxxx' -Dfs.s3n.awsSecretAccessKey='xxxx' -i s3n://hbase_content hdfs:///hbase_content $ hbase -Dhbase.import.version=0.94 org.apache.hadoop.hbase.mapreduce.Driver import content hdfs:///hbase_content Thanks! Z