hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From praveenesh kumar <praveen...@gmail.com>
Subject Re: Giving filename as key to mapper ?
Date Fri, 15 Jul 2011 14:42:26 GMT
I am new to this hadoop API. Can anyone give me some tutorial or code snipet
on how to write your own input format to do these kind of things.
Thanks.

On Fri, Jul 15, 2011 at 8:07 PM, Robert Evans <evans@yahoo-inc.com> wrote:

> To add to that if you really want the file name to be the key instead of
> just calling a different API in your map to get it you will probably need to
> write your own input format to do it.  It should be fairly simple and you
> can base it off of an existing input format to do it.
>
> --Bobby
>
> On 7/15/11 7:40 AM, "Harsh J" <harsh@cloudera.com> wrote:
>
> You can retrieve the filename in the new API as described here:
>
>
> http://search-hadoop.com/m/ZOmmJ1PZJqt1/map+input+filename&subj=Retrieving+Filename
>
> In the old API, its available in the configuration instance of the
> mapper as key "map.input.file". See the table below this section
>
> http://hadoop.apache.org/common/docs/r0.20.2/mapred_tutorial.html#Task+JVM+Reuse
> for more such goodies.
>
> On Fri, Jul 15, 2011 at 5:44 PM, praveenesh kumar <praveenesh@gmail.com>
> wrote:
> > Hi,
> > How can I give filename as key to mapper ?
> > I want to know the occurence of word in set of docs, so I want to keep
> key
> > as filename. Is it possible to give input key as filename in map function
> ?
> > Thanks,
> > Praveenesh
> >
>
>
>
> --
> Harsh J
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message