hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pranay Varma (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-5635) FileInputFormat does not specify how the file is split
Date Wed, 20 Nov 2013 22:59:35 GMT
Pranay Varma created MAPREDUCE-5635:
---------------------------------------

             Summary: FileInputFormat does not specify how the file is split
                 Key: MAPREDUCE-5635
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5635
             Project: Hadoop Map/Reduce
          Issue Type: Bug
    Affects Versions: 2.2.0
         Environment: Does not matter.
            Reporter: Pranay Varma




Here is what the TextInputFormat javadoc says:
[TextInputFormat|http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapreduce/lib/input/TextInputFormat.html]

An InputFormat for plain text files. Files are broken into lines. Either linefeed or carriage-return
are used to signal end of line. Keys are the position in the file, and values are the line
of text..

FileInputFormat should say the same on
[FileInputFormat|http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapreduce/lib/input/FileInputFormat.html]





--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message