hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-hadoop Wiki] Update of "FAQ" by mingyang
Date Thu, 18 Oct 2007 13:46:27 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by mingyang:
http://wiki.apache.org/lucene-hadoop/FAQ

------------------------------------------------------------------------------
   
  E.g. '''org.apache.hadoop.mapred.Sort``Validator.Record``Stats``Checker.Non``Splitable``Sequence``File``Input``Format'''
in [http://svn.apache.org/viewvc/lucene/hadoop/trunk/src/test/org/apache/hadoop/mapred/SortValidator.java
src/test/org/apache/hadoop/mapred/SortValidator.java]
  
+ In addition to implementing the InputFormat interface and having isSplitable(...) returning
false, it is also necessary to implement the RecordReader interface for returning the whole
content of the input file. (default is LineRecordReader, which splits the file into separate
lines)
+ 
  The other, quick-fix option, is to set [http://lucene.apache.org/hadoop/hadoop-default.html#mapred.min.split.size
mapred.min.split.size] to large enough value.
  
  

Mime
View raw message