hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Harsh J <ha...@cloudera.com>
Subject Re: Accessing only particular folder using hadoop streaming
Date Thu, 03 Oct 2013 05:21:24 GMT
You need to use globs when passing your input path, like below perhaps:

data/shard*/d1*

On Thu, Oct 3, 2013 at 1:28 AM, jamal sasha <jamalshasha@gmail.com> wrote:
> Hi,
>     I have data in this one folder like following:
>
> data-------shard1---d1_1
>             |          |_d2_1
>             Lshard2---d1_1
>             |          |_d2_2
>             Lshard3---d1_1
>             |          |_d2_3
>             Lshard4---d1_1
>                        |_d2_4
>
>
> Now, I want to search something in d1 (and excluding all the d2's) in it.
> So how do i do that in python?
> Thanks
>



-- 
Harsh J

Mime
View raw message