hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amareshwari Sri Ramadasu <amar...@yahoo-inc.com>
Subject Re: No KeyValueTextInputFormat in hadoop-0.20.2?
Date Mon, 21 Jun 2010 04:13:00 GMT
The new api KeyValueTextInputFormat is not available in branch 0.20. It is added in branch
0.21 through https://issues.apache.org/jira/browse/MAPREDUCE-655.

Thanks
Amareshwari

On 6/21/10 6:52 AM, "Kevin Tse" <kevintse.onjee@gmail.com> wrote:

Is there anybody knowing about this, please?

On Mon, Jun 14, 2010 at 10:21 PM, Kevin Tse <kevintse.onjee@gmail.com>wrote:

> Hi Ted,
> I mean the new API:
>
> org.apache.hadoop.mapreduce.Job.setInputFormatClass(org.apache.hadoop.mapreduce.InputFormat)
>
> "Job.setInputFormatClass()" only accepts
> "org.apache.hadoop.mapreduce.InputFormat"(of which there are several
> subclasses, while KeyValueTextInputFormat is not one of them) as its
> parameter.
>
> On Mon, Jun 14, 2010 at 10:03 PM, Ted Yu <yuzhihong@gmail.com> wrote:
>
>> Have you checked
>> src/mapred/org/apache/hadoop/mapred/KeyValueTextInputFormat.java ?
>>
>> On Mon, Jun 14, 2010 at 6:51 AM, Kevin Tse <kevintse.onjee@gmail.com>
>> wrote:
>>
>> > Hi,
>> > I am upgrading my code from hadoop-0.19.2 to hadoop-0.20.2, during the
>> > process I found that there was no KeyValueTextInputFormat class which
>> > exists
>> > in hadoop-0.19.2. It's so strange that this version of hadoop does not
>> come
>> > with this commonly used InputFormat. I have taken a look at the
>> > "SecondarySort.java" example code, it uses TextInputFormat and
>> > StringTokenizer to split each line, it is ok but kinda awkward to me.
>> >
>> > Do I have to implement a new InputFormat myself or there's
>> > a KeyValueTextInputFormat that exists somewhere I didn't notice?
>> >
>> > Thank you.
>> > Kevin Tse
>> >
>>
>
>


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message