[ https://issues.apache.org/jira/browse/MAPREDUCE-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aaron Kimball updated MAPREDUCE-1126:
-------------------------------------
Attachment: MAPREDUCE-1126.2.patch
Attaching a new patch for this issue that incorporates the API changes initially proposed
as HADOOP-6438.
This adds a package {{org.apache.hadoop.mapreduce.lib.jobdata}} which contains static classes
to set class-based or schema-based metadata. Explicit subclasses exist for particular serialization
frameworks (e.g., WritableSerialization) to allow users to denote which framework they are
using; in general their behavior is the same as their superclasses.
This keeps the mapreduce-specific job configuration data in the MapReduce project, but does
not clutter JobContext with per-serialization-framework setters for clients to use.
> shuffle should use serialization to get comparator
> --------------------------------------------------
>
> Key: MAPREDUCE-1126
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1126
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: task
> Reporter: Doug Cutting
> Assignee: Aaron Kimball
> Attachments: MAPREDUCE-1126.2.patch, MAPREDUCE-1126.patch
>
>
> Currently the key comparator is defined as a Java class. Instead we should use the Serialization
API to create key comparators. This would permit, e.g., Avro-based comparators to be used,
permitting efficient sorting of complex data types without having to write a RawComparator
in Java.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
|