Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 06F19DAC5 for ; Thu, 4 Oct 2012 15:56:53 +0000 (UTC) Received: (qmail 82933 invoked by uid 500); 4 Oct 2012 15:56:48 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 82806 invoked by uid 500); 4 Oct 2012 15:56:48 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 82799 invoked by uid 99); 4 Oct 2012 15:56:48 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Oct 2012 15:56:48 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.214.48] (HELO mail-bk0-f48.google.com) (209.85.214.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Oct 2012 15:56:40 +0000 Received: by mail-bk0-f48.google.com with SMTP id ik5so402045bkc.35 for ; Thu, 04 Oct 2012 08:56:20 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:x-gm-message-state; bh=IQi8YZvpxFvAEXfd/zBZzYgiHGBcvhVTMqdC+R5de28=; b=OGynXxWOLBJSqXYDiXaw54SoG06DJfWQ+3g6jBCgZGrskN6oPoMdxQXNHz8r1KgUBq WvfDvNSdf3VRtnxoV+nf8u+tpFDMGde9bKidRdluA4AN30krRqp37X0Lkr0X5xuOl9+d AftAUfql7oQfyu898gj6C6F/PuKJWwbZGG7PJisVmLbPBXQ6M91UV+uECnheL/IARyQh n1cnhTUtYm4kj0IfGAfw0ocsclRdBvXk0jEhgzMLy5aJCoColdqwj98BKveio3DkW+yO iw55efyj6cj2WfTc2LPxD24jMLCY1+JxKoHtDYIc9YnNMehdx7ZBsXHeqLabXsv3IIq1 6ILg== Received: by 10.204.5.206 with SMTP id 14mr1820436bkw.79.1349366179755; Thu, 04 Oct 2012 08:56:19 -0700 (PDT) Received: from [192.168.1.106] (static.201.161.129.213.ibercom.com. [213.129.161.201]) by mx.google.com with ESMTPS id ht18sm1377676bkc.14.2012.10.04.08.56.18 (version=SSLv3 cipher=OTHER); Thu, 04 Oct 2012 08:56:18 -0700 (PDT) Message-ID: <506DB19A.6020605@planetalia.com> Date: Thu, 04 Oct 2012 17:56:10 +0200 From: Alexander Hristov User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:15.0) Gecko/20120907 Thunderbird/15.0.1 MIME-Version: 1.0 To: user@hadoop.apache.org Subject: Re: Hadoop Archives under 0.23 References: <506A85B3.90501@planetalia.com> In-Reply-To: Content-Type: multipart/alternative; boundary="------------050308020409050103090004" X-Gm-Message-State: ALoCoQm6Sls+NYwWNq1zfl/FrGWNvv+nWZ9Kz/htlIr0zpYLOqi+7LdEvjtsegne3z4MfJY/r6Nl This is a multi-part message in MIME format. --------------050308020409050103090004 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Thanks, but hadoop dfs is deprecated, hdfs is the recommended way. In any case, the result is exactly the same. > use bin/hadoop dfs -lsr har:///sample/test.har > not dfs -ls -R har:///sample/test.har > > > > On Tue, Oct 2, 2012 at 11:42 AM, Alexander Hristov > > wrote: > > Hello > > I'm trying to test the Hadoop archive functionality under 0.23 and > I can't get it working. > > I have in HDFS a /test folder with several text files. I created > a hadoop archive using > > hadoop archive -archiveName test.har -p /test *.txt /sample > > Ok, this creates a /sample/test.har with the appropriate parts > (_index, _SUCCESS,_masterindex,part-0). Performing a cat on > _index shows the texts files. > However, when I try to even list the contents of the HAR file using > > hdfs dfs -ls -R har:///sample/test.har > > I simply get "har:///sample/test.har : No such file or directory"! > WTF? > > Accessing the individual files does work, however: > > hdfs dfs -cat har:///sample/test.har/file.txt > > works > > Regards > > Alexander > > --------------050308020409050103090004 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit

Thanks, but hadoop dfs is deprecated, hdfs is the recommended way. In any case, the result is exactly the same.

use  bin/hadoop dfs -lsr har:///sample/test.har
 
not  dfs -ls -R har:///sample/test.har



On Tue, Oct 2, 2012 at 11:42 AM, Alexander Hristov <alexh@planetalia.com> wrote:
Hello

I'm trying to test the Hadoop archive functionality under 0.23 and I can't get it working.

I have in HDFS a /test folder with  several text files. I created a hadoop archive using

hadoop archive -archiveName test.har -p /test *.txt  /sample

Ok, this creates a /sample/test.har with the appropriate parts (_index, _SUCCESS,_masterindex,part-0).  Performing a cat on _index shows the texts files.
However, when I try to even list the contents of the HAR file using

hdfs dfs -ls -R har:///sample/test.har

I simply get "har:///sample/test.har : No such file or directory"! WTF?

Accessing the individual files does work, however:

hdfs dfs -cat har:///sample/test.har/file.txt

works

Regards

Alexander



--------------050308020409050103090004--