crunch-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From build...@apache.org
Subject svn commit: r894126 - in /websites/staging/crunch/trunk/content: ./ user-guide.html
Date Tue, 14 Jan 2014 18:06:18 GMT
Author: buildbot
Date: Tue Jan 14 18:06:18 2014
New Revision: 894126

Log:
Staging update by buildbot for crunch

Modified:
    websites/staging/crunch/trunk/content/   (props changed)
    websites/staging/crunch/trunk/content/user-guide.html

Propchange: websites/staging/crunch/trunk/content/
------------------------------------------------------------------------------
--- cms:source-revision (original)
+++ cms:source-revision Tue Jan 14 18:06:18 2014
@@ -1 +1 @@
-1558124
+1558133

Modified: websites/staging/crunch/trunk/content/user-guide.html
==============================================================================
--- websites/staging/crunch/trunk/content/user-guide.html (original)
+++ websites/staging/crunch/trunk/content/user-guide.html Tue Jan 14 18:06:18 2014
@@ -240,7 +240,7 @@ functions (UDFs), then Crunch is most li
 of patterns. The following table illustrates the relationship between these patterns across
the various data pipeline projects that run on
 top of Apache Hadoop:</p>
 <p><a name="rels"></a>
-<table border="1">
+<table border="1" align="center">
   <tr>
     <td>Concept</td>
     <td><a href="http://hadoop.apache.org">Apache Hadoop MapReduce</a></td>
@@ -758,7 +758,7 @@ can set the value of this parameter via 
 <p>Because we specified this parameter on the Source instance and not the Configuration
object directly, we can process multiple
 different files using the NLineInputFormat and not have their different settings conflict
with one another.</p>
 <p>Here is a table of commonly used Sources and their associated usage information:</p>
-<table border="1">
+<table border="1" align="center">
   <tr>
     <td>Input Type</td>
     <td>Source</td>
@@ -768,28 +768,28 @@ different files using the NLineInputForm
   </tr>
   <tr>
     <td>Text</td>
-    <td><pre>org.apache.crunch.io.text.TextFileSource</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/text/TextFileSource.html">org.apache.crunch.io.text.TextFileSource</a></td>
     <td>PCollection&lt;String&gt;</td>
-    <td><pre>textFile</pre></td>
+    <td>textFile</td>
     <td>Works for both TextInputFormat and AvroUtf8InputFormat</td>
   </tr>
   <tr>
     <td>Sequence</td>
-    <td><pre>org.apache.crunch.io.seq.SequenceFileTableSource</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/seq/SequenceFileTableSource.html">org.apache.crunch.io.seq.SequenceFileTableSource</a></td>
     <td>PTable&lt;K, V&gt;</td>
-    <td><pre>sequenceFile</pre></td>
-    <td>Also has a SequenceFileSource which reads the value and ignores the key.</td>
+    <td>sequenceFile</td>
+    <td>Also has a <a href="apidocs/0.9.0/org/apache/crunch/io/seq/SequenceFileSource.html">SequenceFileSource</a>
which reads the value and ignores the key.</td>
   </tr>
   <tr>
     <td>Avro</td>
-    <td><pre>org.apache.crunch.io.avro.AvroFileSource</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/avro/AvroFileSource.html">org.apache.crunch.io.avro.AvroFileSource</a></td>
     <td>PCollection&lt;V&gt;</td>
-    <td><pre>avroFile</pre></td>
+    <td>avroFile</td>
     <td>No PTable analogue for Avro records.</td>
   </tr>
   <tr>
     <td>Parquet</td>
-    <td><pre>org.apache.crunch.io.parquet.AvroParquetFileSource</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/parquet/AvroParquetFileSource.html">org.apache.crunch.io.parquet.AvroParquetFileSource</a></td>
     <td>PCollection&lt;V&gt;</td>
     <td>N/A</td>
     <td>Reads Avro records from a parquet-formatted file; expects an Avro PType.</td>
@@ -835,7 +835,7 @@ parameters that this Target needs:</p>
 </pre>
 
 <p>Here is a table of commonly used Targets:</p>
-<table border="1">
+<table border="1" align="center">
   <tr>
     <td>Output Type</td>
     <td>Target</td>
@@ -844,25 +844,25 @@ parameters that this Target needs:</p>
   </tr>
   <tr>
     <td>Text</td>
-    <td><pre>org.apache.crunch.io.text.TextFileTarget</pre></td>
-    <td><pre>textFile</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/text/TextFileTarget.html">org.apache.crunch.io.text.TextFileTarget</a></td>
+    <td>textFile</td>
     <td>Will write out the string version of whatever it's given, which should be text.
See also: Pipeline.writeTextFile.</td>
   </tr>
   <tr>
     <td>Sequence</td>
-    <td><pre>org.apache.crunch.io.seq.SequenceFileTarget</pre></td>
-    <td><pre>sequenceFile</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/seq/SequenceFileTarget.html">org.apache.crunch.io.seq.SequenceFileTarget</a></td>
+    <td>sequenceFile</td>
     <td>Works on both PCollection and PTable.</td>
   </tr>
   <tr>
     <td>Avro</td>
-    <td><pre>org.apache.crunch.io.avro.AvroFileTarget</pre></td>
-    <td><pre>avroFile</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/avro/AvroFileTarget.html">org.apache.crunch.io.avro.AvroFileTarget</a></td>
+    <td>avroFile</td>
     <td>Treats PTables as PCollections of Pairs.</td>
   </tr>
   <tr>
     <td>Parquet</td>
-    <td><pre>org.apache.crunch.io.parquet.AvroParquetFileTarget</pre></td>
+    <td><a href="apidocs/0.9.0/org/apache/crunch/io/parquet/AvroParquetFileTarget.html">org.apache.crunch.io.parquet.AvroParquetFileTarget</a></td>
     <td>N/A</td>
     <td>Writes Avro records to parquet-formatted files; expects an Avro PType.</td>
   </tr>
@@ -1308,7 +1308,7 @@ your jobs on the JobTracker or Applicati
 </ol>
 <p>There are a number of handy configuration parameters that can be used to adjust
the behavior of MRPipeline that you should be
 aware of:</p>
-<table border="1">
+<table border="1" align="center">
   <tr>
     <td><b>Name</b></td>
     <td><b>Type</b></td>



Mime
View raw message