Return-Path: X-Original-To: apmail-crunch-user-archive@www.apache.org Delivered-To: apmail-crunch-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 82ADB173A7 for ; Mon, 3 Nov 2014 09:48:58 +0000 (UTC) Received: (qmail 42100 invoked by uid 500); 3 Nov 2014 09:48:58 -0000 Delivered-To: apmail-crunch-user-archive@crunch.apache.org Received: (qmail 42058 invoked by uid 500); 3 Nov 2014 09:48:58 -0000 Mailing-List: contact user-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@crunch.apache.org Delivered-To: mailing list user@crunch.apache.org Received: (qmail 42048 invoked by uid 99); 3 Nov 2014 09:48:58 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Nov 2014 09:48:58 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of gabriel.reid@gmail.com designates 209.85.212.179 as permitted sender) Received: from [209.85.212.179] (HELO mail-wi0-f179.google.com) (209.85.212.179) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Nov 2014 09:48:31 +0000 Received: by mail-wi0-f179.google.com with SMTP id h11so5833490wiw.0 for ; Mon, 03 Nov 2014 01:46:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=QGlreqNpg50FxPPz/hMzcNcDkY/xlB2eRAwp6bLPCl4=; b=xv9bCOoEQadu9vshk36YWoChkPiy8ZyRs5cmSYd17+LgJBTznsbtNE6H0tr2ydtNil 0cVHkK+txR3PzEzxYHU0ggTEZHqCtUsBrIFB3bVmYFg8ZtJkSXfU11Mhsy4pKqbzxYdd luI3STmj7xgxgHVf5kzTie3KDVx0WGZRS+74VgWUCjSMMi5zthWYuHaf5WkJFgfgouG1 XuoHhOqQmpceBdOZehaJSJpYpVCPnIXkfTwyZxUGccOHTrfKKLCGhx42oVlfsrbfwnWC 6xT2DAZp1t+ymFk5vTM+YoHh5qYJSYRXbY7n7vkHvpQgHvtw5jzjoz3S9cjn6n1rIbHt hXow== MIME-Version: 1.0 X-Received: by 10.194.134.3 with SMTP id pg3mr46038615wjb.24.1415007975756; Mon, 03 Nov 2014 01:46:15 -0800 (PST) Received: by 10.194.125.208 with HTTP; Mon, 3 Nov 2014 01:46:15 -0800 (PST) In-Reply-To: References: Date: Mon, 3 Nov 2014 10:46:15 +0100 Message-ID: Subject: Re: Input source Text() does not exist! From: Gabriel Reid To: "user@crunch.apache.org" Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org Hi Pavithra, This looks like there is probably a difference in where you're putting your input files and where Hadoop is expecting to find them. Are your input files on your local file system, or on HDFS? And is Hadoop set up to work with your local file system or HDFS? The path "/home/pavithra/in" looks like it is likely a local path (i.e. on the drive of your machine where you're running the command). It could be that when hadoop is trying to read the files, it's connecting to HDFS, and not your local file system. An easy way to check this is to type "hadoop dfs -ls /home/pavithra/in". That command should list the input files you want to use for your Crunch job, but I'm expecting that it will say that that path does not exist (because it's looking on HDFS). Could you try this out and then post back what you get? - Gabriel On Mon, Nov 3, 2014 at 10:05 AM, Pavithra Siriwardena wrote: > Hi folks, > > I am trying to run my first Crunch pipeline, using the example project given > in Crunch Home Page. I'm using Hadoop 2.51 and the Crunch0.11.0. > > When I try to run it locally with using Hadoop's launcher script, > > hadoop jar target/crunch-demo-1.0-SNAPSHOT-job.jar /home/pavithra/in > /home/pavithra/out > > I get the error following error saying 'Input source Text() does not > exist!'; Note that the input text files are in '/home/pavithra/in'. > > Exception in thread "main" java.lang.IllegalStateException: Input source > Text(/home/pavithra/in) does not exist! > at > org.apache.crunch.impl.dist.collect.BaseInputCollection.getSizeInternal(BaseInputCollection.java:66) > at > org.apache.crunch.impl.dist.collect.PCollectionImpl.getSize(PCollectionImpl.java:315) > at > org.apache.crunch.impl.dist.collect.BaseDoCollection.getSizeInternal(BaseDoCollection.java:49) > at > org.apache.crunch.impl.dist.collect.PCollectionImpl.getSize(PCollectionImpl.java:315) > at > org.apache.crunch.impl.dist.collect.BaseDoCollection.getSizeInternal(BaseDoCollection.java:49) > at > org.apache.crunch.impl.dist.collect.PCollectionImpl.getSize(PCollectionImpl.java:315) > at > org.apache.crunch.util.PartitionUtils.getRecommendedPartitions(PartitionUtils.java:51) > at > org.apache.crunch.util.PartitionUtils.getRecommendedPartitions(PartitionUtils.java:40) > at org.apache.crunch.lib.Aggregate.count(Aggregate.java:55) > at > org.apache.crunch.impl.dist.collect.PCollectionImpl.count(PCollectionImpl.java:245) > at com.example.WordCount.run(WordCount.java:54) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at com.example.WordCount.main(WordCount.java:22) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:606) > at org.apache.hadoop.util.RunJar.main(RunJar.java:212) > > Can you please guide me on this problem... > > Thanks > -Pavithra > >