hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kadu canGica Eduardo <kaducang...@gmail.com>
Subject Error using Streaming and old API
Date Wed, 04 Jan 2012 17:07:01 GMT
i'm using streaming with python and i made my own inputformat using the new
API but when i run my job i got the error message:
"org.fasta.InputFormat.FastaInputFormat not

Well, as far as i know the new API was introduced in 0.20 (i'm using, but it seems that streaming don't work with it.

Is there any way to fix this without rewriting all my code of inputformat
with the old API?

Thanks in advance.

package org.fasta.InputFormat;
import java.io.IOException;

import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.compress.CompressionCodec;
import org.apache.hadoop.io.compress.CompressionCodecFactory;
import org.apache.hadoop.mapreduce.InputSplit;
import org.apache.hadoop.mapreduce.JobContext;
import org.apache.hadoop.mapreduce.RecordReader;
import org.apache.hadoop.mapreduce.TaskAttemptContext;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;

public class FastaInputFormat extends FileInputFormat<LongWritable,Text> {

public static String addFistQualityValueProperty =

protected boolean isSplitable(JobContext context, Path file) {
CompressionCodec codec = new CompressionCodecFactory(context
return codec == null;

public RecordReader<LongWritable, Text> createRecordReader(InputSplit split,
TaskAttemptContext context) throws IOException,
InterruptedException {

return new FastaRecordReader();

View raw message