hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom White (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3566) Create an InputFormat for reading lines of text as Java Strings
Date Tue, 29 Jul 2008 14:06:31 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12617793#action_12617793
] 

Tom White commented on HADOOP-3566:
-----------------------------------

The patch should be updated to use the new API in org.apache.hadoop.mapreduce which has a
RecordReader that is compatible with this approach, so there is no need to introduce NewInstanceRecordReader.

> Create an InputFormat for reading lines of text as Java Strings
> ---------------------------------------------------------------
>
>                 Key: HADOOP-3566
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3566
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Tom White
>            Assignee: Tom White
>             Fix For: 0.19.0
>
>         Attachments: hadoop-3566-v2.patch, hadoop-3566-v3.patch, hadoop-3566.patch
>
>
> Such a StringInputFormat would be like TextInputFormat but with input types of Long and
String, rather than LongWritable and Text. This would allow users to write MapReduce programs
that used only Java native types (i.e. no Writables).
> This is currently not possible to write without changes to Hadoop due to a limitation
in the RecordReader interface explained here: https://issues.apache.org/jira/browse/HADOOP-3413?focusedCommentId=12597935#action_12597935

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message