spark-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From pwend...@apache.org
Subject [2/3] spark git commit: CHANGES.txt and changelist updaets for Spark 1.4.
Date Tue, 19 May 2015 04:44:46 GMT
http://git-wip-us.apache.org/repos/asf/spark/blob/914ecd05/CHANGES.txt
----------------------------------------------------------------------
diff --git a/CHANGES.txt b/CHANGES.txt
new file mode 100644
index 0000000..3680d94
--- /dev/null
+++ b/CHANGES.txt
@@ -0,0 +1,14606 @@
+Spark Change Log
+----------------
+
+Release 1.4.0
+
+  Version updates for Spark 1.4.0
+  Patrick Wendell <patrick@databricks.com>
+  2015-05-18 21:38:37 -0700
+  Commit: 9d0b7fb
+
+  [SPARK-7681] [MLLIB] Add SparseVector support for gemv
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-18 21:32:36 -0700
+  Commit: dd9f873, github.com/apache/spark/pull/6209
+
+  [SPARK-7692] Updated Kinesis examples
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-18 18:24:15 -0700
+  Commit: 9c48548, github.com/apache/spark/pull/6249
+
+  [SPARK-7621] [STREAMING] Report Kafka errors to StreamingListeners
+  jerluc <jeremyalucas@gmail.com>
+  2015-05-18 18:13:29 -0700
+  Commit: 9188ad8, github.com/apache/spark/pull/6204
+
+  [SPARK-7624] Revert #4147
+  Davies Liu <davies@databricks.com>
+  2015-05-18 16:55:45 -0700
+  Commit: 60cb33d, github.com/apache/spark/pull/6172
+
+  [SQL] Fix serializability of ORC table scan
+  Michael Armbrust <michael@databricks.com>
+  2015-05-18 15:24:31 -0700
+  Commit: f8f23c4, github.com/apache/spark/pull/6247
+
+  [SPARK-7501] [STREAMING] DAG visualization: show DStream operations
+  Andrew Or <andrew@databricks.com>
+  2015-05-18 14:33:33 -0700
+  Commit: a475cbc, github.com/apache/spark/pull/6034
+
+  [HOTFIX] Fix ORC build break
+  Michael Armbrust <michael@databricks.com>
+  2015-05-18 14:04:04 -0700
+  Commit: ba502ab, github.com/apache/spark/pull/6244
+
+  [SPARK-7658] [STREAMING] [WEBUI] Update the mouse behaviors for the timeline graphs
+  zsxwing <zsxwing@gmail.com>
+  2015-05-18 13:34:43 -0700
+  Commit: 39add3d, github.com/apache/spark/pull/6168
+
+  [SPARK-6216] [PYSPARK] check python version of worker with driver
+  Davies Liu <davies@databricks.com>
+  2015-05-18 12:55:13 -0700
+  Commit: a833209, github.com/apache/spark/pull/6203
+
+  [SPARK-7673] [SQL] WIP: HadoopFsRelation and ParquetRelation2 performance optimizations
+  Cheng Lian <lian@databricks.com>
+  2015-05-18 12:45:37 -0700
+  Commit: 3962348, github.com/apache/spark/pull/6225
+
+  [SPARK-7567] [SQL] [follow-up] Use a new flag to set output committer based on mapreduce apis
+  Yin Huai <yhuai@databricks.com>
+  2015-05-18 12:17:10 -0700
+  Commit: a385f4b, github.com/apache/spark/pull/6130
+
+  [SPARK-7269] [SQL] Incorrect analysis for aggregation(use semanticEquals)
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-18 12:08:28 -0700
+  Commit: d6f5f37, github.com/apache/spark/pull/6173
+
+  [SPARK-7631] [SQL] treenode argString should not print children
+  scwf <wangfei1@huawei.com>
+  2015-05-18 12:05:14 -0700
+  Commit: dbd4ec8, github.com/apache/spark/pull/6144
+
+  [SPARK-2883] [SQL] ORC data source for Spark SQL
+  Zhan Zhang <zhazhan@gmail.com>, Cheng Lian <lian@databricks.com>
+  2015-05-18 12:03:27 -0700
+  Commit: 65d71bd, github.com/apache/spark/pull/6194
+
+  [SPARK-7380] [MLLIB] pipeline stages should be copyable in Python
+  Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
+  2015-05-18 12:02:18 -0700
+  Commit: cf4e04a, github.com/apache/spark/pull/6088
+
+  [SQL] [MINOR] [THIS] use private for internal field in ScalaUdf
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-18 12:01:30 -0700
+  Commit: 7d44c01, github.com/apache/spark/pull/6235
+
+  [SPARK-7570] [SQL] Ignores _temporary during partition discovery
+  Cheng Lian <lian@databricks.com>
+  2015-05-18 11:59:44 -0700
+  Commit: c7623a2, github.com/apache/spark/pull/6091
+
+  [SPARK-6888] [SQL] Make the jdbc driver handling user-definable
+  Rene Treffer <treffer@measite.de>
+  2015-05-18 11:55:36 -0700
+  Commit: b41301a, github.com/apache/spark/pull/5555
+
+  [SPARK-7627] [SPARK-7472] DAG visualization: style skipped stages
+  Andrew Or <andrew@databricks.com>
+  2015-05-18 10:59:35 -0700
+  Commit: a0ae8ce, github.com/apache/spark/pull/6171
+
+  [SPARK-7272] [MLLIB] User guide for PMML model export
+  Vincenzo Selvaggio <vselvaggio@hotmail.it>
+  2015-05-18 08:46:33 -0700
+  Commit: a95d4e1, github.com/apache/spark/pull/6219
+
+  [SPARK-6657] [PYSPARK] Fix doc warnings
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-18 08:35:14 -0700
+  Commit: 2c94ffe, github.com/apache/spark/pull/6221
+
+  [SPARK-7299][SQL] Set precision and scale for Decimal according to JDBC metadata instead of returned BigDecimal
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-18 01:10:55 -0700
+  Commit: 0e7cd8f, github.com/apache/spark/pull/5833
+
+  [SPARK-7694] [MLLIB] Use getOrElse for getting the threshold of LR model
+  Shuo Xiang <shuoxiangpub@gmail.com>
+  2015-05-17 21:16:52 -0700
+  Commit: 0b6bc8a, github.com/apache/spark/pull/6224
+
+  [SPARK-7693][Core] Remove "import scala.concurrent.ExecutionContext.Implicits.global"
+  zsxwing <zsxwing@gmail.com>
+  2015-05-17 20:37:19 -0700
+  Commit: 2a42d2d, github.com/apache/spark/pull/6223
+
+  [SQL] [MINOR] use catalyst type converter in ScalaUdf
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-17 16:51:57 -0700
+  Commit: be66d19, github.com/apache/spark/pull/6182
+
+  [SPARK-6514] [SPARK-5960] [SPARK-6656] [SPARK-7679] [STREAMING] [KINESIS] Updates to the Kinesis API
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-17 16:49:07 -0700
+  Commit: e0632ff, github.com/apache/spark/pull/6147
+
+  [SPARK-7491] [SQL] Allow configuration of classloader isolation for hive
+  Michael Armbrust <michael@databricks.com>
+  2015-05-17 12:43:15 -0700
+  Commit: a855608, github.com/apache/spark/pull/6167
+
+  [SPARK-7686] [SQL] DescribeCommand is assigned wrong output attributes in SparkStrategies
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-17 11:59:28 -0700
+  Commit: 53d6ab5, github.com/apache/spark/pull/6217
+
+  [SPARK-7660] Wrap SnappyOutputStream to work around snappy-java bug
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-17 09:30:49 -0700
+  Commit: 6df71eb, github.com/apache/spark/pull/6176
+
+  [SPARK-7669] Builds against Hadoop 2.6+ get inconsistent curator depend…
+  Steve Loughran <stevel@hortonworks.com>
+  2015-05-17 17:03:11 +0100
+  Commit: 0feb3de, github.com/apache/spark/pull/6191
+
+  [SPARK-7447] [SQL] Don't re-merge Parquet schema when the relation is deserialized
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-17 15:42:21 +0800
+  Commit: 898be62, github.com/apache/spark/pull/6012
+
+  [MINOR] Add 1.3, 1.3.1 to master branch EC2 scripts
+  Shivaram Venkataraman <shivaram@cs.berkeley.edu>
+  2015-05-17 00:12:20 -0700
+  Commit: 0ed376a, github.com/apache/spark/pull/6215
+
+  [MINOR] [SQL] Removes an unreachable case clause
+  Cheng Lian <lian@databricks.com>
+  2015-05-16 23:20:09 -0700
+  Commit: 671a6bc, github.com/apache/spark/pull/6214
+
+  [SPARK-7654][SQL] Move JDBC into DataFrame's reader/writer interface.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-16 22:01:53 -0700
+  Commit: 17e0786, github.com/apache/spark/pull/6210
+
+  [SPARK-7655][Core] Deserializing value should not hold the TaskSchedulerImpl lock
+  zsxwing <zsxwing@gmail.com>
+  2015-05-16 21:03:22 -0700
+  Commit: 8494910, github.com/apache/spark/pull/6195
+
+  [SPARK-7654][MLlib] Migrate MLlib to the DataFrame reader/writer API.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-16 15:03:57 -0700
+  Commit: bd057f8, github.com/apache/spark/pull/6211
+
+  [BUILD] update jblas dependency version to 1.2.4
+  Matthew Brandyberry <mbrandy@us.ibm.com>
+  2015-05-16 18:17:48 +0100
+  Commit: 8bde352, github.com/apache/spark/pull/6199
+
+  [HOTFIX] [SQL] Fixes DataFrameWriter.mode(String)
+  Cheng Lian <lian@databricks.com>
+  2015-05-16 20:55:10 +0800
+  Commit: 856619d, github.com/apache/spark/pull/6212
+
+  [SPARK-7655][Core][SQL] Remove 'scala.concurrent.ExecutionContext.Implicits.global' in 'ask' and 'BroadcastHashJoin'
+  zsxwing <zsxwing@gmail.com>
+  2015-05-16 00:44:29 -0700
+  Commit: ad5b0b1, github.com/apache/spark/pull/6200
+
+  [SPARK-7672] [CORE] Use int conversion in translating kryoserializer.buffer.mb to kryoserializer.buffer
+  Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
+  2015-05-16 08:24:21 +0100
+  Commit: e7607e5, github.com/apache/spark/pull/6198
+
+  [SPARK-4556] [BUILD] binary distribution assembly can't run in local mode
+  Sean Owen <sowen@cloudera.com>
+  2015-05-16 08:18:41 +0100
+  Commit: 1fc3560, github.com/apache/spark/pull/6186
+
+  [SPARK-7671] Fix wrong URLs in MLlib Data Types Documentation
+  FavioVazquez <favio.vazquezp@gmail.com>
+  2015-05-16 08:07:03 +0100
+  Commit: 7e3f9fe, github.com/apache/spark/pull/6196
+
+  [SPARK-7654][SQL] DataFrameReader and DataFrameWriter for input/output API
+  Reynold Xin <rxin@databricks.com>
+  2015-05-15 22:00:31 -0700
+  Commit: 9da55b5, github.com/apache/spark/pull/6175
+
+  [SPARK-7473] [MLLIB] Add reservoir sample in RandomForest
+  AiHe <ai.he@ussuning.com>
+  2015-05-15 20:42:35 -0700
+  Commit: f41be8f, github.com/apache/spark/pull/5988
+
+  [SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files
+  Davies Liu <davies@databricks.com>
+  2015-05-15 20:09:15 -0700
+  Commit: 8164fbc, github.com/apache/spark/pull/6201
+
+  [SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in Python
+  Davies Liu <davies@databricks.com>
+  2015-05-15 20:05:26 -0700
+  Commit: 61806f6, github.com/apache/spark/pull/6206
+
+  [SPARK-7575] [ML] [DOC] Example code for OneVsRest
+  Ram Sriharsha <rsriharsha@hw11853.local>
+  2015-05-15 19:33:20 -0700
+  Commit: 04323ba, github.com/apache/spark/pull/6115
+
+  [SPARK-7563] OutputCommitCoordinator.stop() should only run on the driver
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-15 18:06:01 -0700
+  Commit: ed75cc0, github.com/apache/spark/pull/6197
+
+  [SPARK-7676] Bug fix and cleanup of stage timeline view
+  Kay Ousterhout <kayousterhout@gmail.com>
+  2015-05-15 17:45:14 -0700
+  Commit: 6f78d03, github.com/apache/spark/pull/6202
+
+  [SPARK-7556] [ML] [DOC] Add user guide for spark.ml Binarizer, including Scala, Java and Python examples
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-15 15:05:04 -0700
+  Commit: e847d86, github.com/apache/spark/pull/6116
+
+  [SPARK-7677] [STREAMING] Add Kafka modules to the 2.11 build.
+  Iulian Dragos <jaguarul@gmail.com>
+  2015-05-15 14:57:29 -0700
+  Commit: 31e6404, github.com/apache/spark/pull/6149
+
+  [SPARK-7226] [SPARKR] Support math functions in R DataFrame
+  qhuang <qian.huang@intel.com>
+  2015-05-15 14:06:16 -0700
+  Commit: 9ef6d74, github.com/apache/spark/pull/6170
+
+  [SPARK-7296] Add timeline visualization for stages in the UI.
+  Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+  2015-05-15 13:54:09 -0700
+  Commit: a5f7b3b, github.com/apache/spark/pull/5843
+
+  [SPARK-7504] [YARN] NullPointerException when initializing SparkContext in YARN-cluster mode
+  ehnalis <zoltan.zvara@gmail.com>
+  2015-05-15 12:14:02 -0700
+  Commit: 7dc0ff3, github.com/apache/spark/pull/6083
+
+  [SPARK-7664] [WEBUI] DAG visualization: Fix incorrect link paths of DAG.
+  Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+  2015-05-15 11:54:13 -0700
+  Commit: e319719, github.com/apache/spark/pull/6184
+
+  [SPARK-5412] [DEPLOY] Cannot bind Master to a specific hostname as per the documentation
+  Sean Owen <sowen@cloudera.com>
+  2015-05-15 11:30:19 -0700
+  Commit: fe3c734, github.com/apache/spark/pull/6185
+
+  [CORE] Protect additional test vars from early GC
+  Tim Ellison <t.p.ellison@gmail.com>
+  2015-05-15 11:27:24 -0700
+  Commit: 866e4b5, github.com/apache/spark/pull/6187
+
+  [SPARK-7233] [CORE] Detect REPL mode once
+  Oleksii Kostyliev <etander@gmail.com>, Oleksii Kostyliev <okostyliev@thunderhead.com>
+  2015-05-15 11:19:56 -0700
+  Commit: c58b9c6, github.com/apache/spark/pull/5835
+
+  [SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on bad input
+  FlytxtRnD <meethu.mathew@flytxt.com>
+  2015-05-15 10:43:18 -0700
+  Commit: dfdae58, github.com/apache/spark/pull/6180
+
+  [SPARK-7668] [MLLIB] Preserve isTransposed property for Matrix after calling map function
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-15 10:03:29 -0700
+  Commit: d1f5651, github.com/apache/spark/pull/6188
+
+  [SPARK-7503] [YARN] Resources in .sparkStaging directory can't be cleaned up on error
+  Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+  2015-05-15 11:37:34 +0100
+  Commit: a17a0ee, github.com/apache/spark/pull/6026
+
+  [SPARK-7591] [SQL] Partitioning support API tweaks
+  Cheng Lian <lian@databricks.com>
+  2015-05-15 16:20:49 +0800
+  Commit: bcb2c5d, github.com/apache/spark/pull/6150
+
+  [SPARK-6258] [MLLIB] GaussianMixture Python API parity check
+  Yanbo Liang <ybliang8@gmail.com>
+  2015-05-15 00:18:39 -0700
+  Commit: c0bb974, github.com/apache/spark/pull/6087
+
+  [SPARK-7650] [STREAMING] [WEBUI] Move streaming css and js files to the streaming project
+  zsxwing <zsxwing@gmail.com>
+  2015-05-14 23:51:41 -0700
+  Commit: 0ba99f0, github.com/apache/spark/pull/6160
+
+  [CORE] Remove unreachable Heartbeat message from Worker
+  Kan Zhang <kzhang@apache.org>
+  2015-05-14 23:50:50 -0700
+  Commit: 6742b4e, github.com/apache/spark/pull/6163
+
+  [HOTFIX] Add workaround for SPARK-7660 to fix JavaAPISuite failures.
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-14 23:17:41 -0700
+  Commit: 1206a55
+
+  [SQL] When creating partitioned table scan, explicitly create UnionRDD.
+  Yin Huai <yhuai@databricks.com>
+  2015-05-15 12:04:26 +0800
+  Commit: 7aa269f, github.com/apache/spark/pull/6162
+
+  [SPARK-7098][SQL] Make the WHERE clause with timestamp show consistent result
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-14 20:49:21 -0700
+  Commit: bac4522, github.com/apache/spark/pull/5682
+
+  [SPARK-7548] [SQL] Add explode function for DataFrames
+  Michael Armbrust <michael@databricks.com>
+  2015-05-14 19:49:44 -0700
+  Commit: 778a054, github.com/apache/spark/pull/6107
+
+  [SPARK-7619] [PYTHON] fix docstring signature
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-14 18:16:22 -0700
+  Commit: a238c23, github.com/apache/spark/pull/6161
+
+  [SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.ml
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-14 18:13:58 -0700
+  Commit: f91bb57, github.com/apache/spark/pull/6156
+
+  [SPARK-7645] [STREAMING] [WEBUI] Show milliseconds in the UI if the batch interval < 1 second
+  zsxwing <zsxwing@gmail.com>
+  2015-05-14 16:58:36 -0700
+  Commit: 79983f1, github.com/apache/spark/pull/6154
+
+  [SPARK-7649] [STREAMING] [WEBUI] Use window.localStorage to store the status rather than the url
+  zsxwing <zsxwing@gmail.com>
+  2015-05-14 16:57:33 -0700
+  Commit: 3358485, github.com/apache/spark/pull/6158
+
+  [SPARK-7643] [UI] use the correct size in RDDPage for storage info and partitions
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-14 16:56:32 -0700
+  Commit: 8d8876d, github.com/apache/spark/pull/6157
+
+  [SPARK-7598] [DEPLOY] Add aliveWorkers metrics in Master
+  Rex Xiong <pengx@microsoft.com>
+  2015-05-14 16:55:31 -0700
+  Commit: 894214f, github.com/apache/spark/pull/6117
+
+  Make SPARK prefix a variable
+  tedyu <yuzhihong@gmail.com>
+  2015-05-14 15:26:35 -0700
+  Commit: fceaffc, github.com/apache/spark/pull/6153
+
+  [SPARK-7278] [PySpark] DateType should find datetime.datetime acceptable
+  ksonj <kson@siberie.de>
+  2015-05-14 15:10:58 -0700
+  Commit: a49a145, github.com/apache/spark/pull/6057
+
+  [SQL][minor] rename apply for QueryPlanner
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-14 10:25:18 -0700
+  Commit: aa8a0f9, github.com/apache/spark/pull/6142
+
+  [SPARK-7249] Updated Hadoop dependencies due to inconsistency in the versions
+  FavioVazquez <favio.vazquezp@gmail.com>
+  2015-05-14 15:22:58 +0100
+  Commit: 67ed0aa, github.com/apache/spark/pull/5786
+
+  [SPARK-7568] [ML] ml.LogisticRegression doesn't output the right prediction
+  DB Tsai <dbt@netflix.com>
+  2015-05-14 01:26:08 -0700
+  Commit: 58534b0, github.com/apache/spark/pull/6109
+
+  [SPARK-7407] [MLLIB] use uid + name to identify parameters
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-14 01:22:15 -0700
+  Commit: e45cd9f, github.com/apache/spark/pull/6019
+
+  [SPARK-7595] [SQL] Window will cause resolve failed with self join
+  linweizhong <linweizhong@huawei.com>
+  2015-05-14 00:23:27 -0700
+  Commit: c80e0cf, github.com/apache/spark/pull/6114
+
+  [SPARK-7620] [ML] [MLLIB] Removed calling size, length in while condition to avoid extra JVM call
+  DB Tsai <dbt@netflix.com>
+  2015-05-13 22:23:21 -0700
+  Commit: 9ab4db2, github.com/apache/spark/pull/6137
+
+  [SPARK-7612] [MLLIB] update NB training to use mllib's BLAS
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-13 21:27:17 -0700
+  Commit: 82f387f, github.com/apache/spark/pull/6128
+
+  [HOT FIX #6125] Do not wait for all stages to start rendering
+  Andrew Or <andrew@databricks.com>
+  2015-05-13 21:04:13 -0700
+  Commit: 2d4a961, github.com/apache/spark/pull/6138
+
+  [HOTFIX] Use 'new Job' in fsBasedParquet.scala
+  zsxwing <zsxwing@gmail.com>
+  2015-05-13 17:58:29 -0700
+  Commit: d518c03, github.com/apache/spark/pull/6136
+
+  [SPARK-6752] [STREAMING] [REVISED] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-13 17:33:15 -0700
+  Commit: aec8394, github.com/apache/spark/pull/6096
+
+  [SPARK-7601] [SQL] Support Insert into JDBC Datasource
+  Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
+  2015-05-13 17:24:04 -0700
+  Commit: 820aaa6, github.com/apache/spark/pull/6121
+
+  [SPARK-7081] Faster sort-based shuffle path using binary processing cache-aware sort
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-13 17:07:31 -0700
+  Commit: c53ebea, github.com/apache/spark/pull/5868
+
+  [SPARK-7356] [STREAMING] Fix flakey tests in FlumePollingStreamSuite using SparkSink's batch CountDownLatch.
+  Hari Shreedharan <hshreedharan@apache.org>
+  2015-05-13 16:43:30 -0700
+  Commit: 6c0644a, github.com/apache/spark/pull/5918
+
+  [STREAMING] [MINOR] Keep streaming.UIUtils private
+  Andrew Or <andrew@databricks.com>
+  2015-05-13 16:31:24 -0700
+  Commit: e499a1e, github.com/apache/spark/pull/6134
+
+  [SPARK-7502] DAG visualization: gracefully handle removed stages
+  Andrew Or <andrew@databricks.com>
+  2015-05-13 16:29:52 -0700
+  Commit: 895d46a, github.com/apache/spark/pull/6132
+
+  [SPARK-7464] DAG visualization: highlight the same RDDs on hover
+  Andrew Or <andrew@databricks.com>
+  2015-05-13 16:29:10 -0700
+  Commit: 4b4f10b, github.com/apache/spark/pull/6100
+
+  [SPARK-7399] Spark compilation error for scala 2.11
+  Andrew Or <andrew@databricks.com>
+  2015-05-13 16:28:37 -0700
+  Commit: e6b8cef, github.com/apache/spark/pull/6129
+
+  [SPARK-7608] Clean up old state in RDDOperationGraphListener
+  Andrew Or <andrew@databricks.com>
+  2015-05-13 16:27:48 -0700
+  Commit: ec34230, github.com/apache/spark/pull/6125
+
+  [SQL] Move some classes into packages that are more appropriate.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-13 16:15:31 -0700
+  Commit: acd872b, github.com/apache/spark/pull/6108
+
+  [SPARK-7303] [SQL] push down project if possible when the child is sort
+  scwf <wangfei1@huawei.com>
+  2015-05-13 16:13:48 -0700
+  Commit: d5c52d9, github.com/apache/spark/pull/5838
+
+  [SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-13 15:13:09 -0700
+  Commit: 51230f2, github.com/apache/spark/pull/6106
+
+  [SPARK-7545] [MLLIB] Added check in Bernoulli Naive Bayes to make sure that both training and predict features have values of 0 or 1
+  leahmcguire <lmcguire@salesforce.com>
+  2015-05-13 14:13:19 -0700
+  Commit: d9fb905, github.com/apache/spark/pull/6073
+
+  [SPARK-7593] [ML] Python Api for ml.feature.Bucketizer
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-13 13:21:36 -0700
+  Commit: 11911b0, github.com/apache/spark/pull/6124
+
+  [SPARK-7551][DataFrame] support backticks for DataFrame attribute resolution
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-13 12:47:48 -0700
+  Commit: 3a60bcb, github.com/apache/spark/pull/6074
+
+  [SPARK-7567] [SQL] Migrating Parquet data source to FSBasedRelation
+  Cheng Lian <lian@databricks.com>
+  2015-05-13 11:04:10 -0700
+  Commit: 90f304b, github.com/apache/spark/pull/6090
+
+  [SPARK-7589] [STREAMING] [WEBUI] Make "Input Rate" in the Streaming page consistent with other pages
+  zsxwing <zsxwing@gmail.com>
+  2015-05-13 10:01:26 -0700
+  Commit: 10007fb, github.com/apache/spark/pull/6102
+
+  [SPARK-6734] [SQL] Add UDTF.close support in Generate
+  Cheng Hao <hao.cheng@intel.com>
+  2015-05-14 00:14:59 +0800
+  Commit: 42cf4a2, github.com/apache/spark/pull/5383
+
+  [MINOR] [SQL] Removes debugging println
+  Cheng Lian <lian@databricks.com>
+  2015-05-13 23:40:13 +0800
+  Commit: d78f0e1, github.com/apache/spark/pull/6123
+
+  [SQL] In InsertIntoFSBasedRelation.insert, log cause before abort job/task.
+  Yin Huai <yhuai@databricks.com>
+  2015-05-13 23:36:19 +0800
+  Commit: 9ca28d9, github.com/apache/spark/pull/6105
+
+  [SPARK-7599] [SQL] Don't restrict customized output committers to be subclasses of FileOutputCommitter
+  Cheng Lian <lian@databricks.com>
+  2015-05-13 07:35:55 -0700
+  Commit: cb1fe81, github.com/apache/spark/pull/6118
+
+  [SPARK-6568] spark-shell.cmd --jars option does not accept the jar that has space in its path
+  Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>, Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+  2015-05-13 09:43:40 +0100
+  Commit: bfdecac, github.com/apache/spark/pull/5447
+
+  [SPARK-7526] [SPARKR] Specify ip of RBackend, MonitorServer and RRDD Socket server
+  linweizhong <linweizhong@huawei.com>
+  2015-05-12 23:55:44 -0700
+  Commit: 7bd5274, github.com/apache/spark/pull/6053
+
+  [SPARK-7482] [SPARKR] Rename some DataFrame API methods in SparkR to match their counterparts in Scala.
+  Sun Rui <rui.sun@intel.com>
+  2015-05-12 23:52:30 -0700
+  Commit: b18f1c6, github.com/apache/spark/pull/6007
+
+  [SPARK-7566][SQL] Add type to HiveContext.analyzer
+  Santiago M. Mola <santi@mola.io>
+  2015-05-12 23:44:21 -0700
+  Commit: 6ff3379, github.com/apache/spark/pull/6086
+
+  [SPARK-7321][SQL] Add Column expression for conditional statements (when/otherwise)
+  Reynold Xin <rxin@databricks.com>, kaka1992 <kaka_1992@163.com>
+  2015-05-12 21:43:34 -0700
+  Commit: 219a904, github.com/apache/spark/pull/6072
+
+  [SPARK-7588] Document all SQL/DataFrame public methods with @since tag
+  Reynold Xin <rxin@databricks.com>
+  2015-05-12 18:37:02 -0700
+  Commit: bdd5db9, github.com/apache/spark/pull/6101
+
+  [HOTFIX] Use the old Job API to support old Hadoop versions
+  zsxwing <zsxwing@gmail.com>
+  2015-05-13 08:33:24 +0800
+  Commit: 2cc3301, github.com/apache/spark/pull/6095
+
+  [SPARK-7572] [MLLIB] do not import Param/Params under pyspark.ml
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-12 17:15:39 -0700
+  Commit: 08ec1af, github.com/apache/spark/pull/6094
+
+  [SPARK-7554] [STREAMING] Throw exception when an active/stopped StreamingContext is used to create DStreams and output operations
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-12 17:07:21 -0700
+  Commit: bb81b15, github.com/apache/spark/pull/6099
+
+  [SPARK-7528] [MLLIB] make RankingMetrics Java-friendly
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-12 16:53:47 -0700
+  Commit: 6c292a2, github.com/apache/spark/pull/6098
+
+  [SPARK-7553] [STREAMING] Added methods to maintain a singleton StreamingContext
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-12 16:44:14 -0700
+  Commit: 91fbd93, github.com/apache/spark/pull/6070
+
+  [SPARK-7573] [ML] OneVsRest cleanups
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-05-12 16:42:30 -0700
+  Commit: 612247f, github.com/apache/spark/pull/6097
+
+  [SPARK-7557] [ML] [DOC] User guide for spark.ml HashingTF, Tokenizer
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-05-12 16:39:56 -0700
+  Commit: d080df1, github.com/apache/spark/pull/6093
+
+  [SPARK-7496] [MLLIB] Update Programming guide with Online LDA
+  Yuhao Yang <hhbyyh@gmail.com>
+  2015-05-12 15:12:29 -0700
+  Commit: fe34a59, github.com/apache/spark/pull/6046
+
+  [SPARK-7406] [STREAMING] [WEBUI] Add tooltips for "Scheduling Delay", "Processing Time" and "Total Delay"
+  zsxwing <zsxwing@gmail.com>
+  2015-05-12 14:41:21 -0700
+  Commit: 221375e, github.com/apache/spark/pull/5952
+
+  [SPARK-7571] [MLLIB] rename Math to math
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-12 14:39:03 -0700
+  Commit: 2555517, github.com/apache/spark/pull/6092
+
+  [SPARK-7484][SQL]Support jdbc connection properties
+  Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
+  2015-05-12 14:37:23 -0700
+  Commit: 32819fc, github.com/apache/spark/pull/6009
+
+  [SPARK-7559] [MLLIB] Bucketizer should include the right most boundary in the last bucket.
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-12 14:24:26 -0700
+  Commit: 98ccd93, github.com/apache/spark/pull/6075
+
+  [SPARK-7569][SQL] Better error for invalid binary expressions
+  Michael Armbrust <michael@databricks.com>
+  2015-05-12 13:36:55 -0700
+  Commit: c68485e, github.com/apache/spark/pull/6089
+
+  [SPARK-7015] [MLLIB] [WIP] Multiclass to Binary Reduction: One Against All
+  Ram Sriharsha <rsriharsha@hw11853.local>
+  2015-05-12 13:35:12 -0700
+  Commit: fd16709, github.com/apache/spark/pull/5830
+
+  [SPARK-2018] [CORE] Upgrade LZF library to fix endian serialization p…
+  Tim Ellison <t.p.ellison@gmail.com>
+  2015-05-12 20:48:26 +0100
+  Commit: eadda92, github.com/apache/spark/pull/6077
+
+  [SPARK-7487] [ML] Feature Parity in PySpark for ml.regression
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-12 12:17:05 -0700
+  Commit: 432694c, github.com/apache/spark/pull/6016
+
+  [HOT FIX #6076] DAG visualization: curve the edges
+  Andrew Or <andrew@databricks.com>
+  2015-05-12 12:06:30 -0700
+  Commit: ce6c400
+
+  [SPARK-7276] [DATAFRAME] speed up DataFrame.select by collapsing Project
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-12 11:51:55 -0700
+  Commit: 8be43f8, github.com/apache/spark/pull/5831
+
+  [SPARK-7500] DAG visualization: move cluster labeling to dagre-d3
+  Andrew Or <andrew@databricks.com>
+  2015-05-12 11:17:59 -0700
+  Commit: a236104, github.com/apache/spark/pull/6076
+
+  [DataFrame][minor] support column in field accessor
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-12 10:37:57 -0700
+  Commit: ec89286, github.com/apache/spark/pull/6080
+
+  [SPARK-3928] [SPARK-5182] [SQL] Partitioning support for the data sources API
+  Cheng Lian <lian@databricks.com>
+  2015-05-13 01:32:28 +0800
+  Commit: d232813, github.com/apache/spark/pull/5526
+
+  [DataFrame][minor] cleanup unapply methods in DataTypes
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-12 10:28:40 -0700
+  Commit: a9d84a9, github.com/apache/spark/pull/6079
+
+  [SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark
+  Daoyuan Wang <daoyuan.wang@intel.com>
+  2015-05-12 10:23:41 -0700
+  Commit: 653db0a, github.com/apache/spark/pull/6003
+
+  [SPARK-7532] [STREAMING] StreamingContext.start() made to logWarning and not throw exception
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-12 08:48:24 -0700
+  Commit: 2bbb685, github.com/apache/spark/pull/6060
+
+  [SPARK-7467] Dag visualization: treat checkpoint as an RDD operation
+  Andrew Or <andrew@databricks.com>
+  2015-05-12 01:40:55 -0700
+  Commit: 5601632, github.com/apache/spark/pull/6004
+
+  [SPARK-7485] [BUILD] Remove pyspark files from assembly.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-05-12 01:39:21 -0700
+  Commit: afe54b7, github.com/apache/spark/pull/6022
+
+  [MINOR] [PYSPARK] Set PYTHONPATH to python/lib/pyspark.zip rather than python/pyspark
+  linweizhong <linweizhong@huawei.com>
+  2015-05-12 01:36:27 -0700
+  Commit: 4092a2e, github.com/apache/spark/pull/6047
+
+  [SPARK-7534] [CORE] [WEBUI] Fix the Stage table when a stage is missing
+  zsxwing <zsxwing@gmail.com>
+  2015-05-12 01:34:33 -0700
+  Commit: af374ed, github.com/apache/spark/pull/6061
+
+  [SPARK-6994][SQL] Update docs for fetching Row fields by name
+  vidmantas zemleris <vidmantas@vinted.com>
+  2015-05-11 22:29:24 -0700
+  Commit: 6523fb8, github.com/apache/spark/pull/6030
+
+  [SQL] Rename Dialect -> ParserDialect.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-11 22:06:56 -0700
+  Commit: c6b8148, github.com/apache/spark/pull/6071
+
+  [SPARK-7435] [SPARKR] Make DataFrame.show() consistent with that of Scala and pySpark
+  Joshi <rekhajoshm@gmail.com>, Rekha Joshi <rekhajoshm@gmail.com>
+  2015-05-11 21:02:34 -0700
+  Commit: 835a770, github.com/apache/spark/pull/5989
+
+  [SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-11 20:04:36 -0700
+  Commit: ed40ab5, github.com/apache/spark/pull/6068
+
+  [SPARK-7437] [SQL] Fold "literal in (item1, item2, ..., literal, ...)" into true or false directly
+  Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
+  2015-05-11 19:22:44 -0700
+  Commit: c30982d, github.com/apache/spark/pull/5972
+
+  [SPARK-7411] [SQL] Support SerDe for HiveQl in CTAS
+  Cheng Hao <hao.cheng@intel.com>
+  2015-05-11 19:21:16 -0700
+  Commit: 1a664a0, github.com/apache/spark/pull/5963
+
+  [SPARK-7324] [SQL] DataFrame.dropDuplicates
+  Reynold Xin <rxin@databricks.com>
+  2015-05-11 19:15:14 -0700
+  Commit: 8a9d234, github.com/apache/spark/pull/6066
+
+  [SPARK-7530] [STREAMING] Added StreamingContext.getState() to expose the current state of the context
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-11 18:53:50 -0700
+  Commit: c16b47f, github.com/apache/spark/pull/6058
+
+  [SPARK-5893] [ML] Add bucketizer
+  Xusen Yin <yinxusen@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
+  2015-05-11 18:41:22 -0700
+  Commit: f188815, github.com/apache/spark/pull/5980
+
+  Updated DataFrame.saveAsTable Hive warning to include SPARK-7550 ticket.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-11 18:10:45 -0700
+  Commit: e1e599d, github.com/apache/spark/pull/6067
+
+  [SPARK-7462][SQL] Update documentation for retaining grouping columns in DataFrames.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-11 18:07:12 -0700
+  Commit: eaa6116, github.com/apache/spark/pull/6062
+
+  [SPARK-7084] improve saveAsTable documentation
+  madhukar <phatak.dev@gmail.com>
+  2015-05-11 17:04:11 -0700
+  Commit: 0dbfe16, github.com/apache/spark/pull/5654
+
+  [SQL] Show better error messages for incorrect join types in DataFrames.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-11 17:02:11 -0700
+  Commit: 0ff34f80, github.com/apache/spark/pull/6064
+
+  Update Documentation: leftsemi instead of semijoin
+  LCY Vincent <lauchunyin@gmail.com>
+  2015-05-11 14:48:10 -0700
+  Commit: 788503a, github.com/apache/spark/pull/5944
+
+  [STREAMING] [MINOR] Close files correctly when iterator is finished in streaming WAL recovery
+  jerryshao <saisai.shao@intel.com>
+  2015-05-11 14:38:58 -0700
+  Commit: 9e226e1, github.com/apache/spark/pull/6050
+
+  [SPARK-7516] [Minor] [DOC] Replace depreciated inferSchema() with createDataFrame()
+  gchen <chenguancheng@gmail.com>
+  2015-05-11 14:37:18 -0700
+  Commit: 1538b10, github.com/apache/spark/pull/6041
+
+  [SPARK-7508] JettyUtils-generated servlets to log & report all errors
+  Steve Loughran <stevel@hortonworks.com>
+  2015-05-11 13:35:06 -0700
+  Commit: 779174a, github.com/apache/spark/pull/6033
+
+  [SPARK-7462] By default retain group by columns in aggregate
+  Reynold Xin <rxin@databricks.com>, Shivaram Venkataraman <shivaram@cs.berkeley.edu>
+  2015-05-11 11:35:16 -0700
+  Commit: 9c35f02, github.com/apache/spark/pull/5996
+
+  [SPARK-7361] [STREAMING] Throw unambiguous exception when attempting to start multiple StreamingContexts in the same JVM
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-11 10:58:56 -0700
+  Commit: 11648fa, github.com/apache/spark/pull/5907
+
+  [SPARK-7522] [EXAMPLES] Removed angle brackets from dataFormat option
+  Bryan Cutler <bjcutler@us.ibm.com>
+  2015-05-11 09:23:47 -0700
+  Commit: c234d78, github.com/apache/spark/pull/6049
+
+  [SPARK-6092] [MLLIB] Add RankingMetrics in PySpark/MLlib
+  Yanbo Liang <ybliang8@gmail.com>
+  2015-05-11 09:14:20 -0700
+  Commit: 017f9fa, github.com/apache/spark/pull/6044
+
+  [SPARK-7326] [STREAMING] Performing window() on a WindowedDStream doesn't work all the time
+  Wesley Miao <wesley.miao@gmail.com>, Wesley <wesley.miao@autodesk.com>
+  2015-05-11 12:20:06 +0100
+  Commit: da1be15, github.com/apache/spark/pull/5871
+
+  [SPARK-7519] [SQL] fix minor bugs in thrift server UI
+  tianyi <tianyi.asiainfo@gmail.com>
+  2015-05-11 14:08:15 +0800
+  Commit: fff3c86, github.com/apache/spark/pull/6048
+
+  [SPARK-7512] [SPARKR] Fix RDD's show method to use getJRDD
+  Shivaram Venkataraman <shivaram@cs.berkeley.edu>
+  2015-05-10 19:49:42 -0700
+  Commit: 5f227fd, github.com/apache/spark/pull/6035
+
+  [SPARK-7427] [PYSPARK] Make sharedParams match in Scala, Python
+  Glenn Weidner <gweidner@us.ibm.com>
+  2015-05-10 19:18:32 -0700
+  Commit: 051864e, github.com/apache/spark/pull/6023
+
+  [SPARK-5521] PCA wrapper for easy transform vectors
+  Kirill A. Korinskiy <catap@catap.ru>, Joseph K. Bradley <joseph@databricks.com>
+  2015-05-10 13:34:00 -0700
+  Commit: 193ff69, github.com/apache/spark/pull/4304
+
+  [SPARK-7431] [ML] [PYTHON] Made CrossValidatorModel call parent init in PySpark
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-05-10 13:29:27 -0700
+  Commit: d49b72c, github.com/apache/spark/pull/5968
+
+  [MINOR] [SQL] Fixes variable name typo
+  Cheng Lian <lian@databricks.com>
+  2015-05-10 21:26:36 +0800
+  Commit: fd87b2a, github.com/apache/spark/pull/6038
+
+  [SPARK-7345][SQL] Spark cannot detect renamed columns using JDBC connector
+  Oleg Sidorkin <oleg.sidorkin@gmail.com>
+  2015-05-10 01:31:34 -0700
+  Commit: 5c40403, github.com/apache/spark/pull/6032
+
+  [SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlib
+  Yanbo Liang <ybliang8@gmail.com>
+  2015-05-10 00:57:14 -0700
+  Commit: fe46374, github.com/apache/spark/pull/6011
+
+  [SPARK-7475] [MLLIB] adjust ldaExample for online LDA
+  Yuhao Yang <hhbyyh@gmail.com>
+  2015-05-09 15:40:46 -0700
+  Commit: e96fc86, github.com/apache/spark/pull/6000
+
+  [BUILD] Reference fasterxml.jackson.version in sql/core/pom.xml
+  tedyu <yuzhihong@gmail.com>
+  2015-05-09 13:19:07 -0700
+  Commit: 5110f3e, github.com/apache/spark/pull/6031
+
+  Upgrade version of jackson-databind in sql/core/pom.xml
+  tedyu <yuzhihong@gmail.com>
+  2015-05-09 10:41:30 -0700
+  Commit: 6c5b9ff, github.com/apache/spark/pull/6028
+
+  [STREAMING] [DOCS] Fix wrong url about API docs of StreamingListener
+  dobashim <dobashim@oss.nttdata.co.jp>
+  2015-05-09 10:14:46 +0100
+  Commit: 5dbc7bb, github.com/apache/spark/pull/6024
+
+  [SPARK-7403] [WEBUI] Link URL in objects on Timeline View is wrong in case of running on YARN
+  Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+  2015-05-09 10:10:29 +0100
+  Commit: 869a52d, github.com/apache/spark/pull/5947
+
+  [SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct
+  Vinod K C <vinod.kc@huawei.com>
+  2015-05-09 10:03:15 +0100
+  Commit: b0460f4, github.com/apache/spark/pull/5974
+
+  [SPARK-7498] [ML] removed varargs annotation from Params.setDefaults
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-05-08 21:55:54 -0700
+  Commit: 25972d3, github.com/apache/spark/pull/6021
+
+  [SPARK-7262] [ML] Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package
+  DB Tsai <dbt@netflix.com>
+  2015-05-08 21:43:05 -0700
+  Commit: 80bbe72, github.com/apache/spark/pull/5967
+
+  [SPARK-7375] [SQL] Avoid row copying in exchange when sort.serializeMapOutputs takes effect
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-08 22:09:55 -0400
+  Commit: 21212a2, github.com/apache/spark/pull/5948
+
+  [SPARK-7231] [SPARKR] Changes to make SparkR DataFrame dplyr friendly.
+  Shivaram Venkataraman <shivaram@cs.berkeley.edu>
+  2015-05-08 18:29:57 -0700
+  Commit: 448ff33, github.com/apache/spark/pull/6005
+
+  [SPARK-7451] [YARN] Preemption of executors is counted as failure causing Spark job to fail
+  Ashwin Shankar <ashankar@netflix.com>
+  2015-05-08 17:51:00 -0700
+  Commit: 959c7b6, github.com/apache/spark/pull/5993
+
+  [SPARK-7488] [ML] Feature Parity in PySpark for ml.recommendation
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-08 17:24:32 -0700
+  Commit: 85cab34, github.com/apache/spark/pull/6015
+
+  [SPARK-7237] Clean function in several RDD methods
+  tedyu <yuzhihong@gmail.com>
+  2015-05-08 17:16:38 -0700
+  Commit: 45b6215, github.com/apache/spark/pull/5959
+
+  [SPARK-7469] [SQL] DAG visualization: show SQL query operators
+  Andrew Or <andrew@databricks.com>
+  2015-05-08 17:15:10 -0700
+  Commit: cafffd0, github.com/apache/spark/pull/5999
+
+  [SPARK-6955] Perform port retries at NettyBlockTransferService level
+  Aaron Davidson <aaron@databricks.com>
+  2015-05-08 17:13:55 -0700
+  Commit: 1eae476, github.com/apache/spark/pull/5575
+
+  updated ec2 instance types
+  Brendan Collins <bcollins@blueraster.com>
+  2015-05-08 15:59:34 -0700
+  Commit: 6e35cb5, github.com/apache/spark/pull/6014
+
+  [SPARK-5913] [MLLIB] Python API for ChiSqSelector
+  Yanbo Liang <ybliang8@gmail.com>
+  2015-05-08 15:48:39 -0700
+  Commit: ab48df3, github.com/apache/spark/pull/5939
+
+  [SPARK-4699] [SQL] Make caseSensitive configurable in spark sql analyzer
+  Jacky Li <jacky.likun@huawei.com>, wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
+  2015-05-08 15:25:54 -0700
+  Commit: 21bd722, github.com/apache/spark/pull/5806
+
+  [SPARK-7390] [SQL] Only merge other CovarianceCounter when its count is greater than zero
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-08 14:41:16 -0700
+  Commit: 5205eb4, github.com/apache/spark/pull/5931
+
+  [SPARK-7378] [CORE] Handle deep links to unloaded apps.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-05-08 14:12:58 -0700
+  Commit: 3024f6b, github.com/apache/spark/pull/5922
+
+  [MINOR] [CORE] Allow History Server to read kerberos opts from config file.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-05-08 14:10:27 -0700
+  Commit: 3da5f8b, github.com/apache/spark/pull/5998
+
+  [SPARK-7466] DAG visualization: fix orphan nodes
+  Andrew Or <andrew@databricks.com>
+  2015-05-08 14:09:39 -0700
+  Commit: ca2f1c5, github.com/apache/spark/pull/6002
+
+  [MINOR] Defeat early garbage collection of test suite variable
+  Tim Ellison <t.p.ellison@gmail.com>
+  2015-05-08 14:08:52 -0700
+  Commit: f734c58, github.com/apache/spark/pull/6010
+
+  [SPARK-7489] [SPARK SHELL] Spark shell crashes when compiled with scala 2.11
+  vinodkc <vinod.kc.in@gmail.com>
+  2015-05-08 14:07:53 -0700
+  Commit: 3b7fb7a, github.com/apache/spark/pull/6013
+
+  [WEBUI] Remove debug feature for vis.js
+  Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+  2015-05-08 14:06:37 -0700
+  Commit: 1dde3b3, github.com/apache/spark/pull/5994
+
+  [MINOR] Ignore python/lib/pyspark.zip
+  zsxwing <zsxwing@gmail.com>
+  2015-05-08 14:06:02 -0700
+  Commit: ab0caa0, github.com/apache/spark/pull/6017
+
+  [SPARK-7490] [CORE] [Minor] MapOutputTracker.deserializeMapStatuses: close input streams
+  Evan Jones <ejones@twitter.com>
+  2015-05-08 22:00:39 +0100
+  Commit: 6230809, github.com/apache/spark/pull/5982
+
+  [SPARK-6627] Finished rename to ShuffleBlockResolver
+  Kay Ousterhout <kayousterhout@gmail.com>
+  2015-05-08 12:24:06 -0700
+  Commit: 82be68f, github.com/apache/spark/pull/5764
+
+  [SPARK-7133] [SQL] Implement struct, array, and map field accessor
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-08 11:49:38 -0700
+  Commit: f8468c4, github.com/apache/spark/pull/5744
+
+  [SPARK-7298] Harmonize style of new visualizations
+  Matei Zaharia <matei@databricks.com>
+  2015-05-08 14:41:42 -0400
+  Commit: 0b2c252, github.com/apache/spark/pull/5942
+
+  [SPARK-7436] Fixed instantiation of custom recovery mode factory and added tests
+  Jacek Lewandowski <lewandowski.jacek@gmail.com>
+  2015-05-08 11:38:09 -0700
+  Commit: 89d9487, github.com/apache/spark/pull/5976
+
+  [SPARK-6824] Fill the docs for DataFrame API in SparkR
+  hqzizania <qian.huang@intel.com>, qhuang <qian.huang@intel.com>
+  2015-05-08 11:25:04 -0700
+  Commit: 4f01f5b, github.com/apache/spark/pull/5969
+
+  [SPARK-7474] [MLLIB] update ParamGridBuilder doctest
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-08 11:16:04 -0700
+  Commit: 75fed0c, github.com/apache/spark/pull/6001
+
+  [SPARK-7383] [ML] Feature Parity in PySpark for ml.features
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-08 11:14:39 -0700
+  Commit: 85e1154, github.com/apache/spark/pull/5991
+
+  [SPARK-3454] separate json endpoints for data in the UI
+  Imran Rashid <irashid@cloudera.com>
+  2015-05-08 16:54:32 +0100
+  Commit: 532bfda, github.com/apache/spark/pull/5940
+
+  [SPARK-6869] [PYSPARK] Add pyspark archives path to PYTHONPATH
+  Lianhui Wang <lianhuiwang09@gmail.com>
+  2015-05-08 08:44:46 -0500
+  Commit: acf4bc1, github.com/apache/spark/pull/5580
+
+  [SPARK-7392] [CORE] bugfix: Kryo buffer size cannot be larger than 2M
+  Zhang, Liye <liye.zhang@intel.com>
+  2015-05-08 09:10:58 +0100
+  Commit: f5e9678, github.com/apache/spark/pull/5934
+
+  [SPARK-7232] [SQL] Add a Substitution batch for spark sql analyzer
+  wangfei <wangfei1@huawei.com>
+  2015-05-07 22:55:42 -0700
+  Commit: bb5872f, github.com/apache/spark/pull/5776
+
+  [SPARK-7470] [SQL] Spark shell SQLContext crashes without hive
+  Andrew Or <andrew@databricks.com>
+  2015-05-07 22:32:13 -0700
+  Commit: 1a3e9e9, github.com/apache/spark/pull/5997
+
+  [SPARK-6986] [SQL] Use Serializer2 in more cases.
+  Yin Huai <yhuai@databricks.com>
+  2015-05-07 20:59:42 -0700
+  Commit: 9d0d289, github.com/apache/spark/pull/5849
+
+  [SPARK-7452] [MLLIB] fix bug in topBykey and update test
+  Shuo Xiang <shuoxiangpub@gmail.com>
+  2015-05-07 20:55:08 -0700
+  Commit: 28d4238, github.com/apache/spark/pull/5990
+
+  [SPARK-6908] [SQL] Use isolated Hive client
+  Michael Armbrust <michael@databricks.com>
+  2015-05-07 19:36:24 -0700
+  Commit: 05454fd, github.com/apache/spark/pull/5876
+
+  [SPARK-7305] [STREAMING] [WEBUI] Make BatchPage show friendly information when jobs are dropped by SparkListener
+  zsxwing <zsxwing@gmail.com>
+  2015-05-07 17:34:44 -0700
+  Commit: 2e8a141, github.com/apache/spark/pull/5840
+
+  [SPARK-7450] Use UNSAFE.getLong() to speed up BitSetMethods#anySet()
+  tedyu <yuzhihong@gmail.com>
+  2015-05-07 16:53:59 -0700
+  Commit: 99897fe, github.com/apache/spark/pull/5897
+
+  [SPARK-2155] [SQL] [WHEN D THEN E] [ELSE F] add CaseKeyWhen for "CASE a WHEN b THEN c * END"
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-05-07 16:26:49 -0700
+  Commit: 622a0c5, github.com/apache/spark/pull/5979
+
+  [SPARK-5281] [SQL] Registering table on RDD is giving MissingRequirementError
+  Iulian Dragos <jaguarul@gmail.com>
+  2015-05-07 16:24:11 -0700
+  Commit: 9fd25f7, github.com/apache/spark/pull/5981
+
+  [SPARK-7277] [SQL] Throw exception if the property mapred.reduce.tasks is set to -1
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-07 16:22:45 -0700
+  Commit: 7064ea0, github.com/apache/spark/pull/5811
+
+  [SQL] [MINOR] make star and multialias extend NamedExpression
+  scwf <wangfei1@huawei.com>
+  2015-05-07 16:21:24 -0700
+  Commit: 2425e4d, github.com/apache/spark/pull/5928
+
+  [SPARK-6948] [MLLIB] compress vectors in VectorAssembler
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-07 15:45:37 -0700
+  Commit: 475143a, github.com/apache/spark/pull/5985
+
+  [SPARK-5726] [MLLIB] Elementwise (Hadamard) Vector Product Transformer
+  Octavian Geagla <ogeagla@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
+  2015-05-07 14:49:55 -0700
+  Commit: 76e58b5, github.com/apache/spark/pull/4580
+
+  [SPARK-7328] [MLLIB] [PYSPARK] Pyspark.mllib.linalg.Vectors: Missing items
+  MechCoder <manojkumarsivaraj334@gmail.com>
+  2015-05-07 14:02:05 -0700
+  Commit: 4436e26, github.com/apache/spark/pull/5872
+
+  [SPARK-7347] DAG visualization: add tooltips to RDDs
+  Andrew Or <andrew@databricks.com>
+  2015-05-07 12:29:56 -0700
+  Commit: 1b742a4, github.com/apache/spark/pull/5957
+
+  [SPARK-7391] DAG visualization: auto expand if linked from another viz
+  Andrew Or <andrew@databricks.com>
+  2015-05-07 12:29:18 -0700
+  Commit: 800c0fc, github.com/apache/spark/pull/5958
+
+  [SPARK-7373] [MESOS] Add docker support for launching drivers in mesos cluster mode.
+  Timothy Chen <tnachen@gmail.com>
+  2015-05-07 12:23:16 -0700
+  Commit: 226033c, github.com/apache/spark/pull/5917
+
+  [SPARK-7399] [SPARK CORE] Fixed compilation error in scala 2.11
+  Tijo Thomas <tijoparacka@gmail.com>
+  2015-05-07 12:21:09 -0700
+  Commit: d4e31bf, github.com/apache/spark/pull/5966
+
+  [SPARK-5213] [SQL] Remove the duplicated SparkSQLParser
+  Cheng Hao <hao.cheng@intel.com>
+  2015-05-07 12:09:54 -0700
+  Commit: 2b0c423, github.com/apache/spark/pull/5965
+
+  [SPARK-7116] [SQL] [PYSPARK] Remove cache() causing memory leak
+  ksonj <kson@siberie.de>
+  2015-05-07 12:04:19 -0700
+  Commit: 86f141c, github.com/apache/spark/pull/5973
+
+  [SPARK-1442] [SQL] [FOLLOW-UP] Address minor comments in Window Function PR (#5604).
+  Yin Huai <yhuai@databricks.com>
+  2015-05-07 11:46:49 -0700
+  Commit: 9dcf4f7, github.com/apache/spark/pull/5945
+
+  [SPARK-6093] [MLLIB] Add RegressionMetrics in PySpark/MLlib
+  Yanbo Liang <ybliang8@gmail.com>
+  2015-05-07 11:18:32 -0700
+  Commit: ef835dc, github.com/apache/spark/pull/5941
+
+  [SPARK-7118] [Python] Add the coalesce Spark SQL function available in PySpark
+  Olivier Girardot <o.girardot@lateral-thoughts.com>
+  2015-05-07 10:58:35 -0700
+  Commit: 3038b26, github.com/apache/spark/pull/5698
+
+  [SPARK-7388] [SPARK-7383] wrapper for VectorAssembler in Python
+  Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>
+  2015-05-07 10:25:41 -0700
+  Commit: 6b9737a, github.com/apache/spark/pull/5930
+
+  [SPARK-7330] [SQL] avoid NPE at jdbc rdd
+  Daoyuan Wang <daoyuan.wang@intel.com>
+  2015-05-07 10:05:01 -0700
+  Commit: 84ee348, github.com/apache/spark/pull/5877
+
+  [SPARK-7429] [ML] Params cleanups
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-05-07 01:28:44 -0700
+  Commit: 91ce131, github.com/apache/spark/pull/5960
+
+  [SPARK-7421] [MLLIB] OnlineLDA cleanups
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-05-07 01:12:14 -0700
+  Commit: a038c51, github.com/apache/spark/pull/5956
+
+  [SPARK-7035] Encourage __getitem__ over __getattr__ on column access in the Python DataFrame API
+  ksonj <kson@siberie.de>
+  2015-05-07 01:02:00 -0700
+  Commit: b929a75, github.com/apache/spark/pull/5971
+
+  [SPARK-7295][SQL]  bitwise operations for DataFrame DSL
+  Shiti <ssaxena.ece@gmail.com>
+  2015-05-07 01:00:29 -0700
+  Commit: 703211b, github.com/apache/spark/pull/5867
+
+  [SPARK-7217] [STREAMING] Add configuration to control the default behavior of StreamingContext.stop() implicitly calling SparkContext.stop()
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-07 00:24:44 -0700
+  Commit: cb13c98, github.com/apache/spark/pull/5929
+
+  [SPARK-7430] [STREAMING] [TEST] General improvements to streaming tests to increase debuggability
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-07 00:21:10 -0700
+  Commit: 065d114, github.com/apache/spark/pull/5961
+
+  [SPARK-5938] [SPARK-5443] [SQL] Improve JsonRDD performance
+  Nathan Howell <nhowell@godaddy.com>
+  2015-05-06 22:56:53 -0700
+  Commit: 2337ccc1, github.com/apache/spark/pull/5801
+
+  [SPARK-6812] [SPARKR] filter() on DataFrame does not work as expected.
+  Sun Rui <rui.sun@intel.com>
+  2015-05-06 22:48:16 -0700
+  Commit: 4948f42, github.com/apache/spark/pull/5938
+
+  [SPARK-7432] [MLLIB] disable cv doctest
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-06 22:29:07 -0700
+  Commit: fb4967b, github.com/apache/spark/pull/5962
+
+  [SPARK-7405] [STREAMING] Fix the bug that ReceiverInputDStream doesn't report InputInfo
+  zsxwing <zsxwing@gmail.com>
+  2015-05-06 18:07:00 -0700
+  Commit: d6e76cb, github.com/apache/spark/pull/5950
+
+  [HOT FIX] For DAG visualization #5954
+  Andrew Or <andrew@databricks.com>
+  2015-05-06 18:02:08 -0700
+  Commit: 85a644b
+
+  [SPARK-7371] [SPARK-7377] [SPARK-7408] DAG visualization addendum (#5729)
+  Andrew Or <andrew@databricks.com>
+  2015-05-06 17:52:34 -0700
+  Commit: 76e8344, github.com/apache/spark/pull/5954
+
+  [SPARK-7396] [STREAMING] [EXAMPLE] Update KafkaWordCountProducer to use new Producer API
+  jerryshao <saisai.shao@intel.com>
+  2015-05-06 17:44:43 -0700
+  Commit: ba24dfa, github.com/apache/spark/pull/5936
+
+  [SPARK-6799] [SPARKR] Remove SparkR RDD examples, add dataframe examples
+  Shivaram Venkataraman <shivaram@cs.berkeley.edu>
+  2015-05-06 17:28:11 -0700
+  Commit: 4b91e18, github.com/apache/spark/pull/5949
+
+  [HOT FIX] [SPARK-7418] Ignore flaky SparkSubmitUtilsSuite test
+  Andrew Or <andrew@databricks.com>
+  2015-05-06 17:08:39 -0700
+  Commit: c0ec20a
+
+  [SPARK-5995] [ML] Make Prediction dev API public
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-05-06 16:15:51 -0700
+  Commit: b681b93, github.com/apache/spark/pull/5913
+
+  [HOT-FIX] Move HiveWindowFunctionQuerySuite.scala to hive compatibility dir.
+  Yin Huai <yhuai@databricks.com>
+  2015-05-06 14:48:25 -0700
+  Commit: 14bcb84, github.com/apache/spark/pull/5951
+
+  Add `Private` annotation.
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-06 11:03:17 -0700
+  Commit: 2163367
+
+  [SPARK-7311] Introduce internal Serializer API for determining if serializers support object relocation
+  Josh Rosen <joshrosen@databricks.com>
+  2015-05-06 10:52:55 -0700
+  Commit: d651e28, github.com/apache/spark/pull/5924
+
+  [SPARK-1442] [SQL] Window Function Support for Spark SQL
+  Yin Huai <yhuai@databricks.com>
+  2015-05-06 10:43:00 -0700
+  Commit: b521a3b, github.com/apache/spark/pull/5604
+
+  [SPARK-6201] [SQL] promote string and do widen types for IN
+  Daoyuan Wang <daoyuan.wang@intel.com>
+  2015-05-06 10:30:42 -0700
+  Commit: 7212897, github.com/apache/spark/pull/4945
+
+  [SPARK-5456] [SQL] fix decimal compare for jdbc rdd
+  Daoyuan Wang <daoyuan.wang@intel.com>
+  2015-05-06 10:05:10 -0700
+  Commit: f1a5caf, github.com/apache/spark/pull/5803
+
+  [SQL] JavaDoc update for various DataFrame functions.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-06 08:50:56 -0700
+  Commit: 389b755, github.com/apache/spark/pull/5935
+
+  [SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-06 01:28:43 -0700
+  Commit: 3e27a54, github.com/apache/spark/pull/5926
+
+  [SPARK-7384][Core][Tests] Fix flaky tests for distributed mode in BroadcastSuite
+  zsxwing <zsxwing@gmail.com>
+  2015-05-05 23:25:28 -0700
+  Commit: 20f9237, github.com/apache/spark/pull/5925
+
+  [SPARK-6267] [MLLIB] Python API for IsotonicRegression
+  Yanbo Liang <ybliang8@gmail.com>, Xiangrui Meng <meng@databricks.com>
+  2015-05-05 22:57:13 -0700
+  Commit: 384ac3c, github.com/apache/spark/pull/5890
+
+  [SPARK-7358][SQL] Move DataFrame mathfunctions into functions
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-05 22:56:01 -0700
+  Commit: 8aa6681, github.com/apache/spark/pull/5923
+
+  [SPARK-6841] [SPARKR] add support for mean, median, stdev etc.
+  qhuang <qian.huang@intel.com>
+  2015-05-05 20:39:56 -0700
+  Commit: b5cd7dc, github.com/apache/spark/pull/5446
+
+  Revert "[SPARK-3454] separate json endpoints for data in the UI"
+  Reynold Xin <rxin@databricks.com>
+  2015-05-05 19:28:35 -0700
+  Commit: 765f6e1
+
+  [SPARK-6231][SQL/DF] Automatically resolve join condition ambiguity for self-joins.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-05 18:59:46 -0700
+  Commit: e61083c, github.com/apache/spark/pull/5919
+
+  Some minor cleanup after SPARK-4550.
+  Sandy Ryza <sandy@cloudera.com>
+  2015-05-05 18:32:16 -0700
+  Commit: 762ff2e, github.com/apache/spark/pull/5916
+
+  [SPARK-7230] [SPARKR] Make RDD private in SparkR.
+  Shivaram Venkataraman <shivaram@cs.berkeley.edu>
+  2015-05-05 14:40:33 -0700
+  Commit: 4afb578, github.com/apache/spark/pull/5895
+
+  [SQL][Minor] make StringComparison extends ExpectsInputTypes
+  wangfei <wangfei1@huawei.com>
+  2015-05-05 14:24:37 -0700
+  Commit: b6566a2, github.com/apache/spark/pull/5905
+
+  [SPARK-7351] [STREAMING] [DOCS] Add spark.streaming.ui.retainedBatches to docs
+  zsxwing <zsxwing@gmail.com>
+  2015-05-05 13:42:23 -0700
+  Commit: 4c95fe5, github.com/apache/spark/pull/5899
+
+  [SPARK-7294][SQL] ADD BETWEEN
+  云峤 <chensong.cs@alibaba-inc.com>, kaka1992 <kaka_1992@163.com>
+  2015-05-05 13:23:53 -0700
+  Commit: c68d0e2, github.com/apache/spark/pull/5839
+
+  [SPARK-6939] [STREAMING] [WEBUI] Add timeline and histogram graphs for streaming statistics
+  zsxwing <zsxwing@gmail.com>
+  2015-05-05 12:52:16 -0700
+  Commit: 8109c9e, github.com/apache/spark/pull/5533
+
+  [SPARK-5888] [MLLIB] Add OneHotEncoder as a Transformer
+  Sandy Ryza <sandy@cloudera.com>
+  2015-05-05 12:34:02 -0700
+  Commit: 94ac9eb, github.com/apache/spark/pull/5500
+
+  [SPARK-7333] [MLLIB] Add BinaryClassificationEvaluator to PySpark
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-05 11:45:37 -0700
+  Commit: dfb6bfc, github.com/apache/spark/pull/5885
+
+  [SPARK-7243][SQL] Reduce  size for Contingency Tables in DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-05 11:01:25 -0700
+  Commit: 598902b, github.com/apache/spark/pull/5900
+
+  [SPARK-7007] [CORE] Add a metric source for ExecutorAllocationManager
+  jerryshao <saisai.shao@intel.com>
+  2015-05-05 09:43:49 -0700
+  Commit: 29350ee, github.com/apache/spark/pull/5589
+
+  [SPARK-7318] [STREAMING] DStream cleans objects that are not closures
+  Andrew Or <andrew@databricks.com>
+  2015-05-05 09:37:49 -0700
+  Commit: acc877a, github.com/apache/spark/pull/5860
+
+  [SPARK-7237] Many user provided closures are not actually cleaned
+  Andrew Or <andrew@databricks.com>
+  2015-05-05 09:37:04 -0700
+  Commit: 01d4022, github.com/apache/spark/pull/5787
+
+  [SPARK-6612] [MLLIB] [PYSPARK] Python KMeans parity
+  Hrishikesh Subramonian <hrishikesh.subramonian@flytxt.com>
+  2015-05-05 07:57:39 -0700
+  Commit: 8b63103, github.com/apache/spark/pull/5647
+
+  [SPARK-7202] [MLLIB] [PYSPARK] Add SparseMatrixPickler to SerDe
+  MechCoder <manojkumarsivaraj334@gmail.com>
+  2015-05-05 07:53:11 -0700
+  Commit: cd55e9a, github.com/apache/spark/pull/5775
+
+  [SPARK-7350] [STREAMING] [WEBUI] Attach the Streaming tab when calling ssc.start()
+  zsxwing <zsxwing@gmail.com>
+  2015-05-05 15:09:58 +0100
+  Commit: 49923f7, github.com/apache/spark/pull/5898
+
+  [SPARK-5074] [CORE] [TESTS] Fix the flakey test 'run shuffle with map stage failure' in DAGSchedulerSuite
+  zsxwing <zsxwing@gmail.com>
+  2015-05-05 15:04:14 +0100
+  Commit: 6f35dac, github.com/apache/spark/pull/5903
+
+  [MINOR] Minor update for document
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-05 14:44:02 +0100
+  Commit: d288322, github.com/apache/spark/pull/5906
+
+  [SPARK-3454] separate json endpoints for data in the UI
+  Imran Rashid <irashid@cloudera.com>
+  2015-05-05 07:25:40 -0500
+  Commit: ff8b449, github.com/apache/spark/pull/4435
+
+  [SPARK-5112] Expose SizeEstimator as a developer api
+  Sandy Ryza <sandy@cloudera.com>
+  2015-05-05 12:38:46 +0100
+  Commit: 0327ca2, github.com/apache/spark/pull/3913
+
+  [SPARK-6653] [YARN] New config to specify port for sparkYarnAM actor system
+  shekhar.bansal <shekhar.bansal@guavus.com>
+  2015-05-05 11:09:51 +0100
+  Commit: 93af96a, github.com/apache/spark/pull/5719
+
+  [SPARK-7341] [STREAMING] [TESTS] Fix the flaky test: org.apache.spark.stre...
+  zsxwing <zsxwing@gmail.com>
+  2015-05-05 02:15:39 -0700
+  Commit: 0634510, github.com/apache/spark/pull/5891
+
+  [SPARK-7113] [STREAMING] Support input information reporting for Direct Kafka stream
+  jerryshao <saisai.shao@intel.com>
+  2015-05-05 02:01:06 -0700
+  Commit: becdb81, github.com/apache/spark/pull/5879
+
+  [HOTFIX] [TEST] Ignoring flaky tests
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-05 01:58:51 -0700
+  Commit: e8f847a, github.com/apache/spark/pull/5901
+
+  [SPARK-7139] [STREAMING] Allow received block metadata to be saved to WAL and recovered on driver failure
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-05 01:45:19 -0700
+  Commit: ae27c0e, github.com/apache/spark/pull/5732
+
+  [MINOR] [BUILD] Declare ivy dependency in root pom.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-05-05 08:56:16 +0100
+  Commit: 5160437, github.com/apache/spark/pull/5893
+
+  [SPARK-7314] [SPARK-3524] [PYSPARK] upgrade Pyrolite to 4.4
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-04 23:52:42 -0700
+  Commit: 21ed108, github.com/apache/spark/pull/5850
+
+  [SPARK-7236] [CORE] Fix to prevent AkkaUtils askWithReply from sleeping on final attempt
+  Bryan Cutler <bjcutler@us.ibm.com>
+  2015-05-04 18:29:22 -0700
+  Commit: 48655d1, github.com/apache/spark/pull/5896
+
+  [SPARK-7266] Add ExpectsInputTypes to expressions when possible.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-04 18:03:07 -0700
+  Commit: 1388a46, github.com/apache/spark/pull/5796
+
+  [SPARK-7243][SQL] Contingency Tables for DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-04 17:02:49 -0700
+  Commit: ecf0d8a, github.com/apache/spark/pull/5842
+
+  [SPARK-6943] [SPARK-6944] DAG visualization on SparkUI
+  Andrew Or <andrew@databricks.com>
+  2015-05-04 16:21:36 -0700
+  Commit: 863ec0c, github.com/apache/spark/pull/5729
+
+  [SPARK-7319][SQL] Improve the output from DataFrame.show()
+  云峤 <chensong.cs@alibaba-inc.com>
+  2015-05-04 12:08:38 -0700
+  Commit: 34edaa8, github.com/apache/spark/pull/5865
+
+  [SPARK-5956] [MLLIB] Pipeline components should be copyable.
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-04 11:28:59 -0700
+  Commit: 893b310, github.com/apache/spark/pull/5820
+
+  [SPARK-5100] [SQL] add webui for thriftserver
+  tianyi <tianyi.asiainfo@gmail.com>
+  2015-05-04 16:59:34 +0800
+  Commit: 343d3bf, github.com/apache/spark/pull/5730
+
+  [SPARK-5563] [MLLIB] LDA with online variational inference
+  Yuhao Yang <hhbyyh@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
+  2015-05-04 00:06:25 -0700
+  Commit: 3539cb7, github.com/apache/spark/pull/4419
+
+  [SPARK-7241] Pearson correlation for DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-03 21:44:39 -0700
+  Commit: 9646018, github.com/apache/spark/pull/5858
+
+  [SPARK-7329] [MLLIB] simplify ParamGridBuilder impl
+  Xiangrui Meng <meng@databricks.com>
+  2015-05-03 18:06:48 -0700
+  Commit: 1ffa8cb, github.com/apache/spark/pull/5873
+
+  [SPARK-7302] [DOCS] SPARK building documentation still mentions building for yarn 0.23
+  Sean Owen <sowen@cloudera.com>
+  2015-05-03 21:22:31 +0100
+  Commit: 9e25b09, github.com/apache/spark/pull/5863
+
+  [SPARK-6907] [SQL] Isolated client for HiveMetastore
+  Michael Armbrust <michael@databricks.com>
+  2015-05-03 13:12:50 -0700
+  Commit: daa70bf, github.com/apache/spark/pull/5851
+
+  [SPARK-7022] [PYSPARK] [ML] Add ML.Tuning.ParamGridBuilder to PySpark
+  Omede Firouz <ofirouz@palantir.com>, Omede <omedefirouz@gmail.com>
+  2015-05-03 11:42:02 -0700
+  Commit: f4af925, github.com/apache/spark/pull/5601
+
+  [SPARK-7031] [THRIFTSERVER] let thrift server take SPARK_DAEMON_MEMORY and SPARK_DAEMON_JAVA_OPTS
+  WangTaoTheTonic <wangtao111@huawei.com>
+  2015-05-03 00:47:47 +0100
+  Commit: 49549d5, github.com/apache/spark/pull/5609
+
+  [SPARK-7255] [STREAMING] [DOCUMENTATION] Added documentation for spark.streaming.kafka.maxRetries
+  BenFradet <benjamin.fradet@gmail.com>
+  2015-05-02 23:41:14 +0100
+  Commit: ea841ef, github.com/apache/spark/pull/5808
+
+  [SPARK-5213] [SQL] Pluggable SQL Parser Support
+  Cheng Hao <hao.cheng@intel.com>, scwf <wangfei1@huawei.com>
+  2015-05-02 15:20:07 -0700
+  Commit: 5d6b90d, github.com/apache/spark/pull/5827
+
+  [MINOR] [HIVE] Fix QueryPartitionSuite.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-05-02 23:10:35 +0100
+  Commit: 82c8c37, github.com/apache/spark/pull/5854
+
+  [SPARK-6030] [CORE] Using simulated field layout method to compute class shellSize
+  Ye Xianjin <advancedxy@gmail.com>
+  2015-05-02 23:08:09 +0100
+  Commit: bfcd528, github.com/apache/spark/pull/4783
+
+  [SPARK-7323] [SPARK CORE] Use insertAll instead of insert while merging combiners in reducer
+  Mridul Muralidharan <mridulm@yahoo-inc.com>
+  2015-05-02 23:05:51 +0100
+  Commit: da30352, github.com/apache/spark/pull/5862
+
+  [SPARK-3444] Fix typo in Dataframes.py introduced in []
+  Dean Chen <deanchen5@gmail.com>
+  2015-05-02 23:04:13 +0100
+  Commit: 856a571, github.com/apache/spark/pull/5866
+
+  [SPARK-7315] [STREAMING] [TEST] Fix flaky WALBackedBlockRDDSuite
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-05-02 01:53:14 -0700
+  Commit: ecc6eb5, github.com/apache/spark/pull/5853
+
+  [SPARK-7120] [SPARK-7121] Closure cleaner nesting + documentation + tests
+  Andrew Or <andrew@databricks.com>
+  2015-05-01 23:57:58 -0700
+  Commit: 7394e7a, github.com/apache/spark/pull/5685
+
+  [SPARK-7242] added python api for freqItems in DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-01 23:43:24 -0700
+  Commit: 2e0f357, github.com/apache/spark/pull/5859
+
+  [SPARK-7317] [Shuffle] Expose shuffle handle
+  Mridul Muralidharan <mridulm@yahoo-inc.com>
+  2015-05-01 21:23:42 -0700
+  Commit: b79aeb9, github.com/apache/spark/pull/5857
+
+  [SPARK-6229] Add SASL encryption to network library.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-05-01 19:01:46 -0700
+  Commit: 38d4e9e, github.com/apache/spark/pull/5377
+
+  [SPARK-2691] [MESOS] Support for Mesos DockerInfo
+  Chris Heller <hellertime@gmail.com>
+  2015-05-01 18:41:22 -0700
+  Commit: 8f50a07, github.com/apache/spark/pull/3074
+
+  [SPARK-6443] [SPARK SUBMIT] Could not submit app in standalone cluster mode when HA is enabled
+  WangTaoTheTonic <wangtao111@huawei.com>
+  2015-05-01 18:38:20 -0700
+  Commit: b4b43df, github.com/apache/spark/pull/5116
+
+  [SPARK-7216] [MESOS] Add driver details page to Mesos cluster UI.
+  Timothy Chen <tnachen@gmail.com>
+  2015-05-01 18:36:42 -0700
+  Commit: 2022193, github.com/apache/spark/pull/5763
+
+  [SPARK-6954] [YARN] ExecutorAllocationManager can end up requesting a negative n...
+  Sandy Ryza <sandy@cloudera.com>
+  2015-05-01 18:32:46 -0700
+  Commit: 099327d, github.com/apache/spark/pull/5704
+
+  [SPARK-3444] Provide an easy way to change log level
+  Holden Karau <holden@pigscanfly.ca>
+  2015-05-01 18:02:10 -0700
+  Commit: ae98eec, github.com/apache/spark/pull/5791
+
+  [SPARK-2808][Streaming][Kafka] update kafka to 0.8.2
+  cody koeninger <cody@koeninger.org>, Helena Edelson <helena.edelson@datastax.com>
+  2015-05-01 17:54:56 -0700
+  Commit: 4786484, github.com/apache/spark/pull/4537
+
+  [SPARK-7112][Streaming][WIP] Add a InputInfoTracker to track all the input streams
+  jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
+  2015-05-01 17:46:06 -0700
+  Commit: b88c275, github.com/apache/spark/pull/5680
+
+  [SPARK-7309] [CORE] [STREAMING] Shutdown the thread pools in ReceivedBlockHandler and DAGScheduler
+  zsxwing <zsxwing@gmail.com>
+  2015-05-01 17:41:55 -0700
+  Commit: ebc25a4, github.com/apache/spark/pull/5845
+
+  [SPARK-6999] [SQL] Remove the infinite recursive method (useless)
+  Cheng Hao <hao.cheng@intel.com>
+  2015-05-01 19:39:30 -0500
+  Commit: 98e7045, github.com/apache/spark/pull/5804
+
+  [SPARK-7304] [BUILD] Include $@ in call to mvn consistently in make-distribution.sh
+  Rajendra Gokhale (rvgcentos) <rvg@cloudera.com>
+  2015-05-01 17:01:36 -0700
+  Commit: e6fb377, github.com/apache/spark/pull/5846
+
+  [SPARK-7312][SQL] SPARK-6913 broke jdk6 build
+  Yin Huai <yhuai@databricks.com>
+  2015-05-01 16:47:00 -0700
+  Commit: 41c6a44, github.com/apache/spark/pull/5847
+
+  Ignore flakey test in SparkSubmitUtilsSuite
+  Patrick Wendell <patrick@databricks.com>
+  2015-05-01 14:42:58 -0700
+  Commit: 5c1faba
+
+  [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS
+  Hari Shreedharan <hshreedharan@apache.org>
+  2015-05-01 15:32:09 -0500
+  Commit: b1f4ca8, github.com/apache/spark/pull/5823
+
+  [SPARK-7240][SQL] Single pass covariance calculation for dataframes
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-05-01 13:29:17 -0700
+  Commit: 4dc8d74, github.com/apache/spark/pull/5825
+
+  [SPARK-7281] [YARN] Add option to set AM's lib path in client mode.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-05-01 21:20:46 +0100
+  Commit: 7b5dd3e, github.com/apache/spark/pull/5813
+
+  [SPARK-7213] [YARN] Check for read permissions before copying a Hadoop config file
+  Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
+  2015-05-01 21:14:16 +0100
+  Commit: f53a488, github.com/apache/spark/pull/5760
+
+  Revert "[SPARK-7224] added mock repository generator for --packages tests"
+  Patrick Wendell <patrick@databricks.com>
+  2015-05-01 13:01:43 -0700
+  Commit: c6d9a42
+
+  Revert "[SPARK-7287] enabled fixed test"
+  Patrick Wendell <patrick@databricks.com>
+  2015-05-01 13:01:14 -0700
+  Commit: 58d6584
+
+  [SPARK-7274] [SQL] Create Column expression for array/struct creation.
+  Reynold Xin <rxin@databricks.com>
+  2015-05-01 12:49:02 -0700
+  Commit: 3753776, github.com/apache/spark/pull/5802
+
+  [SPARK-7183] [NETWORK] Fix memory leak of TransportRequestHandler.streamIds
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-01 11:59:12 -0700
+  Commit: 1686032, github.com/apache/spark/pull/5743
+
+  [SPARK-6846] [WEBUI] [HOTFIX] return to GET for kill link in UI since YARN AM won't proxy POST
+  Sean Owen <sowen@cloudera.com>
+  2015-05-01 19:57:37 +0100
+  Commit: 1262e31, github.com/apache/spark/pull/5837
+
+  [SPARK-5854] personalized page rank
+  Dan McClary <dan.mcclary@gmail.com>, dwmclary <dan.mcclary@gmail.com>
+  2015-05-01 11:55:43 -0700
+  Commit: 7d42722, github.com/apache/spark/pull/4774
+
+  changing persistence engine trait to an abstract class
+  niranda <niranda.perera@gmail.com>
+  2015-05-01 11:27:45 -0700
+  Commit: 27de6fe, github.com/apache/spark/pull/5832
+
+  Limit help option regex
+  Chris Biow <chris.biow@10gen.com>
+  2015-05-01 19:26:55 +0100
+  Commit: c8c481d, github.com/apache/spark/pull/5816
+
+  [SPARK-5891] [ML] Add Binarizer ML Transformer
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-05-01 08:31:01 -0700
+  Commit: 7630213, github.com/apache/spark/pull/5699
+
+  [SPARK-3066] [MLLIB] Support recommendAll in matrix factorization model
+  Debasish Das <debasish.das@one.verizon.com>, Xiangrui Meng <meng@databricks.com>
+  2015-05-01 08:27:46 -0700
+  Commit: 3b514af, github.com/apache/spark/pull/3098
+
+  [SPARK-4705] Handle multiple app attempts event logs, history server.
+  Marcelo Vanzin <vanzin@cloudera.com>, twinkle sachdeva <twinkle@kite.ggn.in.guavus.com>, twinkle.sachdeva <twinkle.sachdeva@guavus.com>, twinkle sachdeva <twinkle.sachdeva@guavus.com>
+  2015-05-01 09:50:55 -0500
+  Commit: 3052f49, github.com/apache/spark/pull/5432
+
+  [SPARK-3468] [WEBUI] Timeline-View feature
+  Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+  2015-05-01 01:39:56 -0700
+  Commit: 7fe0f3f, github.com/apache/spark/pull/2342
+
+  [SPARK-6257] [PYSPARK] [MLLIB] MLlib API missing items in Recommendation
+  MechCoder <manojkumarsivaraj334@gmail.com>
+  2015-04-30 23:51:00 -0700
+  Commit: c24aeb6, github.com/apache/spark/pull/5807
+
+  [SPARK-7291] [CORE] Fix a flaky test in AkkaRpcEnvSuite
+  zsxwing <zsxwing@gmail.com>
+  2015-04-30 23:44:33 -0700
+  Commit: 14b3288, github.com/apache/spark/pull/5822
+
+  [SPARK-7287] enabled fixed test
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-30 23:39:58 -0700
+  Commit: 7cf1eb7, github.com/apache/spark/pull/5826
+
+  [SPARK-4550] In sort-based shuffle, store map outputs in serialized form
+  Sandy Ryza <sandy@cloudera.com>
+  2015-04-30 23:14:14 -0700
+  Commit: 0a2b15c, github.com/apache/spark/pull/4450
+
+  HOTFIX: Disable buggy dependency checker
+  Patrick Wendell <patrick@databricks.com>
+  2015-04-30 22:39:58 -0700
+  Commit: a9fc505
+
+  [SPARK-6479] [BLOCK MANAGER] Create off-heap block storage API
+  Zhan Zhang <zhazhan@gmail.com>
+  2015-04-30 22:24:31 -0700
+  Commit: 36a7a68, github.com/apache/spark/pull/5430
+
+  [SPARK-7248] implemented random number generators for DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-30 21:56:03 -0700
+  Commit: b5347a4, github.com/apache/spark/pull/5819
+
+  [SPARK-7282] [STREAMING] Fix the race conditions in StreamingListenerSuite
+  zsxwing <zsxwing@gmail.com>
+  2015-04-30 21:32:11 -0700
+  Commit: 69a739c, github.com/apache/spark/pull/5812
+
+  Revert "[SPARK-5213] [SQL] Pluggable SQL Parser Support"
+  Patrick Wendell <patrick@databricks.com>
+  2015-04-30 20:33:36 -0700
+  Commit: beeafcf
+
+  [SPARK-7123] [SQL] support table.star in sqlcontext
+  scwf <wangfei1@huawei.com>
+  2015-04-30 18:50:14 -0700
+  Commit: 473552f, github.com/apache/spark/pull/5690
+
+  [SPARK-5213] [SQL] Pluggable SQL Parser Support
+  Cheng Hao <hao.cheng@intel.com>
+  2015-04-30 18:49:06 -0700
+  Commit: 3ba5aaa, github.com/apache/spark/pull/4015
+
+  [SPARK-6913][SQL] Fixed "java.sql.SQLException: No suitable driver found"
+  Vyacheslav Baranov <slavik.baranov@gmail.com>
+  2015-04-30 18:45:14 -0700
+  Commit: e991255, github.com/apache/spark/pull/5782
+
+  [SPARK-7109] [SQL] Push down left side filter for left semi join
+  wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
+  2015-04-30 18:18:54 -0700
+  Commit: a0d8a61, github.com/apache/spark/pull/5677
+
+  [SPARK-7093] [SQL] Using newPredicate in NestedLoopJoin to enable code generation
+  scwf <wangfei1@huawei.com>
+  2015-04-30 18:15:56 -0700
+  Commit: 0797338, github.com/apache/spark/pull/5665
+
+  [SPARK-7280][SQL] Add "drop" column/s on a data frame
+  rakeshchalasani <vnit.rakesh@gmail.com>
+  2015-04-30 17:42:50 -0700
+  Commit: ee04413, github.com/apache/spark/pull/5818
+
+  [SPARK-7242][SQL][MLLIB] Frequent items for DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-30 16:40:32 -0700
+  Commit: 149b3ee, github.com/apache/spark/pull/5799
+
+  [SPARK-7279] Removed diffSum which is theoretical zero in LinearRegression and coding formating
+  DB Tsai <dbt@netflix.com>
+  2015-04-30 16:26:51 -0700
+  Commit: 1c3e402, github.com/apache/spark/pull/5809
+
+  [Build] Enable MiMa checks for SQL
+  Josh Rosen <joshrosen@databricks.com>
+  2015-04-30 16:23:01 -0700
+  Commit: fa01bec, github.com/apache/spark/pull/5727
+
+  [SPARK-7267][SQL]Push down Project when it's child is Limit
+  Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
+  2015-04-30 15:22:13 -0700
+  Commit: 77cc25f, github.com/apache/spark/pull/5797
+
+  [SPARK-7288] Suppress compiler warnings due to use of sun.misc.Unsafe; add facade in front of Unsafe; remove use of Unsafe.setMemory
+  Josh Rosen <joshrosen@databricks.com>
+  2015-04-30 15:21:00 -0700
+  Commit: 07a8620, github.com/apache/spark/pull/5814
+
+  [SPARK-7196][SQL] Support precision and scale of decimal type for JDBC
+  Liang-Chi Hsieh <viirya@gmail.com>
+  2015-04-30 15:13:43 -0700
+  Commit: 6702324, github.com/apache/spark/pull/5777
+
+  Revert "[SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS"
+  Patrick Wendell <patrick@databricks.com>
+  2015-04-30 14:59:20 -0700
+  Commit: e0628f2
+
+  [SPARK-7207] [ML] [BUILD] Added ml.recommendation, ml.regression to SparkBuild
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-04-30 14:39:27 -0700
+  Commit: adbdb19, github.com/apache/spark/pull/5758
+
+  [SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS
+  Hari Shreedharan <hshreedharan@apache.org>
+  2015-04-30 13:03:23 -0500
+  Commit: 6c65da6, github.com/apache/spark/pull/4688
+
+  [SPARK-7224] added mock repository generator for --packages tests
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-30 10:19:08 -0700
+  Commit: 7dacc08, github.com/apache/spark/pull/5790
+
+  [HOTFIX] Disabling flaky test (fix in progress as part of SPARK-7224)
+  Patrick Wendell <patrick@databricks.com>
+  2015-04-30 01:02:33 -0700
+  Commit: 47bf406
+
+  [SPARK-1406] Mllib pmml model export
+  Vincenzo Selvaggio <vselvaggio@hotmail.it>, Xiangrui Meng <meng@databricks.com>, selvinsource <vselvaggio@hotmail.it>
+  2015-04-29 23:21:21 -0700
+  Commit: 254e050, github.com/apache/spark/pull/3062
+
+  [SPARK-7225][SQL] CombineLimits optimizer does not work
+  Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
+  2015-04-29 22:44:14 -0700
+  Commit: 4459514, github.com/apache/spark/pull/5770
+
+  Some code clean up.
+  DB Tsai <dbt@netflix.com>
+  2015-04-29 21:44:41 -0700
+  Commit: ba49eb1, github.com/apache/spark/pull/5794
+
+  [SPARK-7156][SQL] Addressed follow up comments for randomSplit
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-29 19:13:47 -0700
+  Commit: 5553198, github.com/apache/spark/pull/5795
+
+  [SPARK-7234][SQL] Fix DateType mismatch when codegen on.
+  云峤 <chensong.cs@alibaba-inc.com>
+  2015-04-29 18:23:42 -0700
+  Commit: 7143f6e, github.com/apache/spark/pull/5778
+
+  [SPARK-6862] [STREAMING] [WEBUI] Add BatchPage to display details of a batch
+  zsxwing <zsxwing@gmail.com>
+  2015-04-29 18:22:14 -0700
+  Commit: 1b7106b, github.com/apache/spark/pull/5473
+
+  [SPARK-7176] [ML] Add validation functionality to Param
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-04-29 17:26:46 -0700
+  Commit: 114bad6, github.com/apache/spark/pull/5740
+
+  [SQL] [Minor] Print detail query execution info when spark answer is not right
+  wangfei <wangfei1@huawei.com>
+  2015-04-29 17:00:24 -0700
+  Commit: 1fdfdb4, github.com/apache/spark/pull/5774
+
+  [SPARK-7259] [ML] VectorIndexer: do not copy non-ML metadata to output column
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-04-29 16:35:17 -0700
+  Commit: b1ef6a6, github.com/apache/spark/pull/5789
+
+  [SPARK-7229] [SQL] SpecificMutableRow should take integer type as internal representation for Date
+  Cheng Hao <hao.cheng@intel.com>
+  2015-04-29 16:23:34 -0700
+  Commit: f8cbb0a, github.com/apache/spark/pull/5772
+
+  [SPARK-7155] [CORE] Allow newAPIHadoopFile to support comma-separated list of files as input
+  yongtang <yongtang@users.noreply.github.com>
+  2015-04-29 23:55:51 +0100
+  Commit: 3fc6cfd, github.com/apache/spark/pull/5708
+
+  [SPARK-7181] [CORE] fix inifite loop in Externalsorter's mergeWithAggregation
+  Qiping Li <liqiping1991@gmail.com>
+  2015-04-29 23:52:16 +0100
+  Commit: 7f4b583, github.com/apache/spark/pull/5737
+
+  [SPARK-7156][SQL] support RandomSplit in DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-29 15:34:05 -0700
+  Commit: d7dbce8, github.com/apache/spark/pull/5761
+
+  [SPARK-6529] [ML] Add Word2Vec transformer
+  Xusen Yin <yinxusen@gmail.com>
+  2015-04-29 14:55:32 -0700
+  Commit: c9d530e, github.com/apache/spark/pull/5596
+
+  [SPARK-7222] [ML] Added mathematical derivation in comment and compressed the model, removed the correction terms in LinearRegression with ElasticNet
+  DB Tsai <dbt@netflix.com>
+  2015-04-29 14:53:37 -0700
+  Commit: 15995c8, github.com/apache/spark/pull/5767
+
+  [SPARK-6629] cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads
+  Josh Rosen <joshrosen@databricks.com>
+  2015-04-29 13:31:52 -0700
+  Commit: 3a180c1, github.com/apache/spark/pull/5288
+
+  [SPARK-6752] [STREAMING] [REOPENED] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-04-29 13:10:31 -0700
+  Commit: a9c4e29, github.com/apache/spark/pull/5773
+
+  [SPARK-7056] [STREAMING] Make the Write Ahead Log pluggable
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-04-29 13:06:11 -0700
+  Commit: 1868bd4, github.com/apache/spark/pull/5645
+
+  Fix a typo of "threshold"
+  Xusen Yin <yinxusen@gmail.com>
+  2015-04-29 10:13:48 -0700
+  Commit: c0c0ba6, github.com/apache/spark/pull/5769
+
+  [SQL][Minor] fix java doc for DataFrame.agg
+  Wenchen Fan <cloud0fan@outlook.com>
+  2015-04-29 09:49:24 -0700
+  Commit: 81ea42b, github.com/apache/spark/pull/5712
+
+  Better error message on access to non-existing attribute
+  ksonj <kson@siberie.de>
+  2015-04-29 09:48:47 -0700
+  Commit: 3df9c5d, github.com/apache/spark/pull/5771
+
+  [SPARK-7223] Rename RPC askWithReply -> askWithReply, sendWithReply -> ask.
+  Reynold Xin <rxin@databricks.com>
+  2015-04-29 09:46:37 -0700
+  Commit: 687273d, github.com/apache/spark/pull/5768
+
+  [SPARK-6918] [YARN] Secure HBase support.
+  Dean Chen <deanchen5@gmail.com>
+  2015-04-29 08:58:33 -0500
+  Commit: baed3f2, github.com/apache/spark/pull/5586
+
+  [SPARK-7076][SPARK-7077][SPARK-7080][SQL] Use managed memory for aggregations
+  Josh Rosen <joshrosen@databricks.com>
+  2015-04-29 01:07:26 -0700
+  Commit: f49284b, github.com/apache/spark/pull/5725
+
+  [SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operations
+  Patrick Wendell <patrick@databricks.com>
+  2015-04-29 00:35:08 -0700
+  Commit: 1fd6ed9, github.com/apache/spark/pull/5757
+
+  [SPARK-7188] added python support for math DataFrame functions
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-29 00:09:24 -0700
+  Commit: fe917f5, github.com/apache/spark/pull/5750
+
+  MAINTENANCE: Automated closing of pull requests.
+  Patrick Wendell <patrick@databricks.com>
+  2015-04-28 23:38:59 -0700
+  Commit: 8dee274, github.com/apache/spark/pull/3205
+
+  [SPARK-7205] Support `.ivy2/local` and `.m2/repositories/` in --packages
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-28 23:05:02 -0700
+  Commit: f98773a, github.com/apache/spark/pull/5755
+
+  [SPARK-7215] made coalesce and repartition a part of the query plan
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-28 22:48:04 -0700
+  Commit: 271c4c6, github.com/apache/spark/pull/5762
+
+  [SPARK-6756] [MLLIB] add toSparse, toDense, numActives, numNonzeros, and compressed to Vector
+  Xiangrui Meng <meng@databricks.com>
+  2015-04-28 21:49:53 -0700
+  Commit: 5ef006f, github.com/apache/spark/pull/5756
+
+  [SPARK-7208] [ML] [PYTHON] Added Matrix, SparseMatrix to __all__ list in linalg.py
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-04-28 21:15:47 -0700
+  Commit: a8aeadb, github.com/apache/spark/pull/5759
+
+  [SPARK-7138] [STREAMING] Add method to BlockGenerator to add multiple records to BlockGenerator with single callback
+  Tathagata Das <tathagata.das1565@gmail.com>
+  2015-04-28 19:31:57 -0700
+  Commit: 5c8f4bd, github.com/apache/spark/pull/5695
+
+  [SPARK-6965] [MLLIB] StringIndexer handles numeric input.
+  Xiangrui Meng <meng@databricks.com>
+  2015-04-28 17:41:09 -0700
+  Commit: d36e673, github.com/apache/spark/pull/5753
+
+  Closes #4807 Closes #5055 Closes #3583
+  Xiangrui Meng <meng@databricks.com>
+  2015-04-28 14:21:25 -0700
+  Commit: 555213e
+
+  [SPARK-7201] [MLLIB] move Identifiable to ml.util
+  Xiangrui Meng <meng@databricks.com>
+  2015-04-28 14:07:26 -0700
+  Commit: f0a1f90, github.com/apache/spark/pull/5749
+
+  [MINOR] [CORE] Warn users who try to cache RDDs with dynamic allocation on.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-04-28 13:49:29 -0700
+  Commit: 28b1af7, github.com/apache/spark/pull/5751
+
+  [SPARK-5338] [MESOS] Add cluster mode support for Mesos
+  Timothy Chen <tnachen@gmail.com>, Luc Bourlier <luc.bourlier@typesafe.com>
+  2015-04-28 13:31:08 -0700
+  Commit: 53befac, github.com/apache/spark/pull/5144
+
+  [SPARK-6314] [CORE] handle JsonParseException for history server
+  Zhang, Liye <liye.zhang@intel.com>
+  2015-04-28 12:33:48 -0700
+  Commit: 8009810, github.com/apache/spark/pull/5736
+
+  [SPARK-5932] [CORE] Use consistent naming for size properties
+  Ilya Ganelin <ilya.ganelin@capitalone.com>
+  2015-04-28 12:18:55 -0700
+  Commit: 2d222fb, github.com/apache/spark/pull/5574
+
+  [SPARK-4286] Add an external shuffle service that can be run as a daemon.
+  Iulian Dragos <jaguarul@gmail.com>
+  2015-04-28 12:08:18 -0700
+  Commit: 8aab94d, github.com/apache/spark/pull/4990
+
+  [Core][test][minor] replace try finally block with tryWithSafeFinally
+  Zhang, Liye <liye.zhang@intel.com>
+  2015-04-28 10:24:00 -0700
+  Commit: 52ccf1d, github.com/apache/spark/pull/5739
+
+  [SPARK-7140] [MLLIB] only scan the first 16 entries in Vector.hashCode
+  Xiangrui Meng <meng@databricks.com>
+  2015-04-28 09:59:36 -0700
+  Commit: b14cd23, github.com/apache/spark/pull/5697
+
+  [SPARK-5253] [ML] LinearRegression with L1/L2 (ElasticNet) using OWLQN
+  DB Tsai <dbt@netflix.com>, DB Tsai <dbtsai@alpinenow.com>
+  2015-04-28 09:46:08 -0700
+  Commit: 6a827d5, github.com/apache/spark/pull/4259
+
+  [SPARK-6435] spark-shell --jars option does not add all jars to classpath
+  Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+  2015-04-28 07:55:21 -0400
+  Commit: 268c419, github.com/apache/spark/pull/5227
+
+  [SPARK-7100] [MLLIB] Fix persisted RDD leak in GradientBoostTrees
+  Jim Carroll <jim@dontcallme.com>
+  2015-04-28 07:51:02 -0400
+  Commit: 75905c5, github.com/apache/spark/pull/5669
+
+  [SPARK-7168] [BUILD] Update plugin versions in Maven build and centralize versions
+  Sean Owen <sowen@cloudera.com>
+  2015-04-28 07:48:34 -0400
+  Commit: 7f3b3b7, github.com/apache/spark/pull/5720
+
+  [SPARK-6352] [SQL] Custom parquet output committer
+  Pei-Lun Lee <pllee@appier.com>
+  2015-04-28 16:50:18 +0800
+  Commit: e13cd86, github.com/apache/spark/pull/5525
+
+  [SPARK-7135][SQL] DataFrame expression for monotonically increasing IDs.
+  Reynold Xin <rxin@databricks.com>
+  2015-04-28 00:39:08 -0700
+  Commit: d94cd1a, github.com/apache/spark/pull/5709
+
+  [SPARK-7187] SerializationDebugger should not crash user code
+  Andrew Or <andrew@databricks.com>
+  2015-04-28 00:38:14 -0700
+  Commit: bf35edd, github.com/apache/spark/pull/5734
+
+  [SPARK-5946] [STREAMING] Add Python API for direct Kafka stream
+  jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
+  2015-04-27 23:48:02 -0700
+  Commit: 9e4e82b, github.com/apache/spark/pull/4723
+
+  [SPARK-6829] Added math functions for DataFrames
+  Burak Yavuz <brkyvz@gmail.com>
+  2015-04-27 23:10:14 -0700
+  Commit: 29576e7, github.com/apache/spark/pull/5616
+
+  [SPARK-7174][Core] Move calling `TaskScheduler.executorHeartbeatReceived` to another thread
+  zsxwing <zsxwing@gmail.com>
+  2015-04-27 21:45:40 -0700
+  Commit: 874a2ca, github.com/apache/spark/pull/5723
+
+  [SPARK-7090] [MLLIB] Introduce LDAOptimizer to LDA to further improve extensibility
+  Yuhao Yang <hhbyyh@gmail.com>
+  2015-04-27 19:02:51 -0700
+  Commit: 4d9e560, github.com/apache/spark/pull/5661
+
+  [SPARK-7162] [YARN] Launcher error in yarn-client
+  GuoQiang Li <witgo@qq.com>
+  2015-04-27 19:52:41 -0400
+  Commit: 62888a4, github.com/apache/spark/pull/5716
+
+  [SPARK-7145] [CORE] commons-lang (2.x) classes used instead of commons-lang3 (3.x); commons-io used without dependency
+  Sean Owen <sowen@cloudera.com>
+  2015-04-27 19:50:55 -0400
+  Commit: ab5adb7, github.com/apache/spark/pull/5703
+
+  [SPARK-3090] [CORE] Stop SparkContext if user forgets to.
+  Marcelo Vanzin <vanzin@cloudera.com>
+  2015-04-27 19:46:17 -0400
+  Commit: 5d45e1f, github.com/apache/spark/pull/5696
+
+  [SPARK-6738] [CORE] Improve estimate the size of a large array
+  Hong Shen <hongshen@tencent.com>
+  2015-04-27 18:57:31 -0400
+  Commit: 8e1c00d, github.com/apache/spark/pull/5608
+
+  [SPARK-7103] Fix crash with SparkContext.union when RDD has no partitioner
+  Steven She <steven@canopylabs.com>
+  2015-04-27 18:55:02 -0400
+  Commit: b9de9e0, github.com/apache/spark/pull/5679
+
+  [SPARK-6991] [SPARKR] Adds support for zipPartitions.
+  hlin09 <hlin09pu@gmail.com>
+  2015-04-27 15:04:37 -0700
+  Commit: ca9f4eb, github.com/apache/spark/pull/5568
+
+  SPARK-7107 Add parameter for zookeeper.znode.parent to hbase_inputformat...
+  tedyu <yuzhihong@gmail.com>
+  2015-04-27 14:42:40 -0700
+  Commit: ef82bdd, github.com/apache/spark/pull/5673
+
+  [SPARK-6856] [R] Make RDD information more useful in SparkR
+  Jeff Harrison <jeffrharrison@gmail.com>
+  2015-04-27 13:38:25 -0700
+  Commit: 7078f60, github.com/apache/spark/pull/5667
+
+  [SPARK-4925] Publish Spark SQL hive-thriftserver maven artifact
+  Misha Chernetsov <chernetsov@gmail.com>
+  2015-04-27 11:27:56 -0700
+  Commit: 998aac2, github.com/apache/spark/pull/5429
+
+  [SPARK-6505] [SQL] Remove the reflection call in HiveFunctionWrapper
+  baishuo <vc_java@hotmail.com>
+  2015-04-27 14:08:05 +0800
+  Commit: 82bb7fd, github.com/apache/spark/pull/5660
+
+  [SQL][Minor] rename DataTypeParser.apply to DataTypeParser.parse
+  wangfei <wangfei1@huawei.com>
+  2015-04-26 21:08:47 -0700
+  Commit: d188b8b, github.com/apache/spark/pull/5710
+
+  [SPARK-7152][SQL] Add a Column expression for partition ID.
+  Reynold Xin <rxin@databricks.com>
+  2015-04-26 11:46:58 -0700
+  Commit: ca55dc9, github.com/apache/spark/pull/5705
+
+  [MINOR] [MLLIB] Refactor toString method in MLLIB
+  Alain <aihe@usc.edu>
+  2015-04-26 07:14:24 -0400
+  Commit: 9a5bbe0, github.com/apache/spark/pull/5687
+
+  [SPARK-6014] [CORE] [HOTFIX] Add try-catch block around ShutDownHook
+  Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
+  2015-04-25 20:02:23 -0400
+  Commit: f5473c2, github.com/apache/spark/pull/5672
+
+  [SPARK-7092] Update spark scala version to 2.11.6
+  Prashant Sharma <prashant.s@imaginea.com>
+  2015-04-25 18:07:34 -0400
+  Commit: a11c868, github.com/apache/spark/pull/5662
+
+  [SQL] Update SQL readme to include instructions on generating golden answer files based on Hive 0.13.1.
+  Yin Huai <yhuai@databricks.com>
+  2015-04-25 13:43:39 -0700
+  Commit: aa6966f, github.com/apache/spark/pull/5702
+
+  [SPARK-6113] [ML] Tree ensembles for Pipelines API
+  Joseph K. Bradley <joseph@databricks.com>
+  2015-04-25 12:27:19 -0700
+  Commit: a7160c4, github.com/apache/spark/pull/5626
+
+  Revert "[SPARK-6752][Streaming] Allow StreamingContext to be recreated from checkpoint and existing SparkContext"
+  Patrick Wendell <patrick@databricks.com>
+  2015-04-25 10:37:34 -0700
+  Commit: a61d65f
+
+  update the deprecated CountMinSketchMonoid function to TopPctCMS function
+  KeheCAI <caikehe@gmail.com>
+  2015-04-25 08:42:38 -0400
+  Commit: cca9905, github.com/apache/spark/pull/5629
+
+  [SPARK-7136][Docs] Spark SQL and DataFrame Guide fix example file and paths
+  Deborah Siegel <deborah.siegel@gmail.com>, DEBORAH SIEGEL <deborahsiegel@d-140-142-0-49.dhcp4.washington.edu>, DEBORAH SIEGEL <deborahsiegel@DEBORAHs-MacBook-Pro.local>, DEBORAH SIEGEL <deborahsiegel@d-69-91-154-197.dhcp4.washington.edu>
+  2015-04-24 20:25:07 -0700
+  Commit: 59b7cfc, github.com/apache/spark/pull/5693
+
+  [PySpark][Minor] Update sql example, so that can read file correctly
+  linweizhong <linweizhong@huawei.com>
+  2015-04-24 20:23:19 -0700
+  Commit: d874f8b, github.com/apache/spark/pull/5684
+
+  [SPARK-6122] [CORE] Upgrade tachyon-client version to 0.6.3
+  Calvin Jia <jia.calvin@gmail.com>
+  2015-04-24 17:57:41 -0400
+  Commit: 438859e, github.com/apache/spark/pull/5354
+
+  [SPARK-6852] [SPARKR] Accept numeric as numPartitions in SparkR.
+  Sun Rui <rui.sun@intel.com>
+  2015-04-24 12:52:07 -0700
+  Commit: caf0136, github.com/apache/spark/pull/5613
+
+  [SPARK-7033] [SPARKR] Clean usage of split. Use partition instead where applicable.
+  Sun Rui <rui.sun@intel.com>
+  2015-04-24 11:00:19 -0700
+  Commit: ebb77b2, github.com/apache/spark/pull/5628
+
+  [SPARK-6528] [ML] Add IDF transformer
+  Xusen Yin <yinxusen@gmail.com>
+  2015-04-24 08:29:49 -0700
+  Commit: 6e57d57, github.com/apache/spark/pull/5266
+
+  [SPARK-7115] [MLLIB] skip the very first 1 in poly expansion
+  Xiangrui Meng <meng@databricks.com>
+  2015-04-24 08:27:48 -0700
+  Commit: 78b39c7, github.com/apache/spark/pull/5681
+
+  [SPARK-5894] [ML] Add polynomial mapper
+  Xusen Yin <yinxusen@gmail.com>, Xiangrui Meng <meng@databricks.com>
+  2015-04-24 00:39:29 -0700
+  Commit: 8509519, github.com/apache/spark/pull/5245
+
+  Fixed a typo from the previous commit.
+  Reynold Xin <rxin@databricks.com>
+  2015-04-23 22:39:00 -0700
+  Commit: 4c722d7
+
+  [SQL] Fixed expression data type matching.
+  Reynold Xin <rxin@databricks.com>
+  2015-04-23 21:21:03 -0700
+  Commit: d3a302d, github.com/apache/spark/pull/5675
+
+  Update sql-programming-guide.md
+  Ken Geis <geis.ken@gmail.com>
+  2015-04-23 20:45:33 -0700
+  Commit: 67bccbd, github.com/apache/spark/pull/5674
+
+  [SPARK-7060][SQL] Add alias function to python dataframe
+  Yin Huai <yhuai@databricks.com>
+  2015-04-23 18:52:55 -0700
+  Commit: 2d010f7, github.com/apache/spark/pull/5634
+
+  [SPARK-7037] [CORE] Inconsistent behavior for non-spark config properties in spark-shell and spark-submit
+  Cheolsoo Park <cheolsoop@netflix.com>
+  20

<TRUNCATED>

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org


Mime
View raw message