saluc |
PySpark: breakdown application execution time and fine-tuning |
Sat, 17 Oct, 10:10 |
Bharath Ravi Kumar |
Re: Spark on Mesos / Executor Memory |
Sat, 17 Oct, 12:08 |
Deenar Toraskar |
Re: How to have Single refernce of a class in Spark Streaming? |
Sat, 17 Oct, 12:13 |
Bharath Ravi Kumar |
Re: Spark on Mesos / Executor Memory |
Sat, 17 Oct, 12:40 |
Raghavendra Pandey |
Re: Complex transformation on a dataframe column |
Sat, 17 Oct, 12:55 |
Raghavendra Pandey |
Re: repartition vs partitionby |
Sat, 17 Oct, 12:57 |
Raghavendra Pandey |
Re: s3a file system and spark deployment mode |
Sat, 17 Oct, 13:07 |
shahid ashraf |
Re: repartition vs partitionby |
Sat, 17 Oct, 13:14 |
Kali.tumm...@gmail.com |
can I use Spark as alternative for gem fire cache ? |
Sat, 17 Oct, 13:28 |
Ndjido Ardo Bar |
Re: can I use Spark as alternative for gem fire cache ? |
Sat, 17 Oct, 18:05 |
Kali.tumm...@gmail.com |
Output println info in LogMessage Info ? |
Sat, 17 Oct, 18:40 |
Adrian Tanase |
Spark Streaming scheduler delay VS driver.cores |
Sat, 17 Oct, 19:58 |
Adrian Tanase |
Re: repartition vs partitionby |
Sat, 17 Oct, 20:25 |
Gavin Yue |
Should I convert json into parquet? |
Sat, 17 Oct, 21:07 |
Marco Mistroni |
Re: Problem installing Sparck on Windows 8 |
Sat, 17 Oct, 21:51 |
jatinganhotra |
Checkpointing calls the job twice? |
Sun, 18 Oct, 03:38 |
unk1102 |
callUdf("percentile_approx",col("mycol"),lit(0.25)) does not compile spark 1.5.1 source but it does work in spark 1.5.1 bin |
Sun, 18 Oct, 07:10 |
tarek.abouzei...@yahoo.com.INVALID |
Re: Spark handling parallel requests |
Sun, 18 Oct, 07:26 |
tarek.abouzei...@yahoo.com.INVALID |
Re: Spark handling parallel requests |
Sun, 18 Oct, 07:28 |
Oded Maimon |
Spark Streaming - use the data in different jobs |
Sun, 18 Oct, 11:49 |
anshu shukla |
REST api to avoid spark context creation |
Sun, 18 Oct, 12:13 |
VJ Anand |
No suitable Constructor found while compiling |
Sun, 18 Oct, 13:39 |
Raghavendra Pandey |
Re: REST api to avoid spark context creation |
Sun, 18 Oct, 14:09 |
Ted Yu |
Re: No suitable Constructor found while compiling |
Sun, 18 Oct, 14:18 |
Ted Yu |
Re: callUdf("percentile_approx",col("mycol"),lit(0.25)) does not compile spark 1.5.1 source but it does work in spark 1.5.1 bin |
Sun, 18 Oct, 15:50 |
igor.berman |
our spark gotchas report while creating batch pipeline |
Sun, 18 Oct, 15:51 |
Umesh Kacha |
Re: callUdf("percentile_approx",col("mycol"),lit(0.25)) does not compile spark 1.5.1 source but it does work in spark 1.5.1 bin |
Sun, 18 Oct, 15:58 |
Ted Yu |
Re: our spark gotchas report while creating batch pipeline |
Sun, 18 Oct, 16:07 |
Kali.tumm...@gmail.com |
Pass spark partition explicitly ? |
Sun, 18 Oct, 17:56 |
Richard Eggert |
Re: Pass spark partition explicitly ? |
Sun, 18 Oct, 18:05 |
sri hari kali charan Tummala |
Re: Pass spark partition explicitly ? |
Sun, 18 Oct, 18:22 |
Jorge Sánchez |
Re: dataframes and numPartitions |
Sun, 18 Oct, 19:46 |
Jorge Sánchez |
Re: How VectorIndexer works in Spark ML pipelines |
Sun, 18 Oct, 19:54 |
Jia Zhan |
Re: In-memory computing and cache() in Spark |
Sun, 18 Oct, 20:28 |
Igor Berman |
Re: our spark gotchas report while creating batch pipeline |
Sun, 18 Oct, 20:29 |
Mustafa Elbehery |
Indexing Support |
Sun, 18 Oct, 21:16 |
Jerry Lam |
Re: Indexing Support |
Sun, 18 Oct, 22:26 |
Russ Weeks |
Re: Indexing Support |
Sun, 18 Oct, 23:10 |
Ted Yu |
Re: callUdf("percentile_approx",col("mycol"),lit(0.25)) does not compile spark 1.5.1 source but it does work in spark 1.5.1 bin |
Mon, 19 Oct, 03:02 |
ReeceRobinson |
Spark SQL Thriftserver and Hive UDF in Production |
Mon, 19 Oct, 03:04 |
Mohammed Guller |
RE: Spark SQL Thriftserver and Hive UDF in Production |
Mon, 19 Oct, 03:42 |
shahid ashraf |
Re: repartition vs partitionby |
Mon, 19 Oct, 05:14 |
Sonal Goyal |
Re: In-memory computing and cache() in Spark |
Mon, 19 Oct, 05:32 |
Jörn Franke |
Re: Should I convert json into parquet? |
Mon, 19 Oct, 05:32 |
fahad shah |
pyspark groupbykey throwing error: unpack requires a string argument of length 4 |
Mon, 19 Oct, 05:42 |
Jeff Zhang |
Re: pyspark groupbykey throwing error: unpack requires a string argument of length 4 |
Mon, 19 Oct, 06:17 |
fahad shah |
Re: pyspark groupbykey throwing error: unpack requires a string argument of length 4 |
Mon, 19 Oct, 06:28 |
Igor Berman |
Re: In-memory computing and cache() in Spark |
Mon, 19 Oct, 06:32 |
Akhil Das |
Re: Spark handling parallel requests |
Mon, 19 Oct, 06:34 |
Chandra Mohan, Ananda Vel Murugan |
RE: Get the previous state string in Spark streaming |
Mon, 19 Oct, 07:12 |
ZhuGe |
master die and worker registration failed with duplicated worker id |
Mon, 19 Oct, 07:43 |
fahad shah |
best way to generate per key auto increment numerals after sorting |
Mon, 19 Oct, 08:11 |
Ewan Leith |
RE: Should I convert json into parquet? |
Mon, 19 Oct, 09:31 |
Ewan Leith |
RE: Spark Streaming - use the data in different jobs |
Mon, 19 Oct, 09:34 |
mas |
Re: Incrementally add/remove vertices in GraphX |
Mon, 19 Oct, 10:36 |
Umesh Kacha |
Re: callUdf("percentile_approx",col("mycol"),lit(0.25)) does not compile spark 1.5.1 source but it does work in spark 1.5.1 bin |
Mon, 19 Oct, 11:00 |
shahid ashraf |
SHUFFLE in PARTITIONBY or shuffle in general |
Mon, 19 Oct, 11:16 |
vaibhavrtk |
Is one batch created by Streaming Context always equal to one RDD? |
Mon, 19 Oct, 11:39 |
YiZhi Liu |
How to take user jars precedence over Spark jars |
Mon, 19 Oct, 12:07 |
Eugene Chepurniy |
Spark executor on Mesos - how to set effective user id? |
Mon, 19 Oct, 12:14 |
varun sharma |
Issue in spark batches |
Mon, 19 Oct, 12:48 |
Eugen Cepoi |
spark streaming failing to replicate blocks |
Mon, 19 Oct, 12:51 |
Jerry Lam |
Re: Spark executor on Mesos - how to set effective user id? |
Mon, 19 Oct, 13:05 |
Adrian Tanase |
Re: Spark Streaming - use the data in different jobs |
Mon, 19 Oct, 13:43 |
SLiZn Liu |
Re: Spark executor on Mesos - how to set effective user id? |
Mon, 19 Oct, 13:45 |
Adrian Tanase |
Re: Should I convert json into parquet? |
Mon, 19 Oct, 13:47 |
shahid |
Re: How does shuffle work in spark ? |
Mon, 19 Oct, 13:54 |
Ted Yu |
Re: How to take user jars precedence over Spark jars |
Mon, 19 Oct, 14:23 |
Zhiliang Zhu |
[Spark MLlib] How to apply spark ml given models for questions with general background |
Mon, 19 Oct, 14:46 |
YiZhi Liu |
Re: How to take user jars precedence over Spark jars |
Mon, 19 Oct, 14:47 |
Ted Yu |
Re: How to take user jars precedence over Spark jars |
Mon, 19 Oct, 14:52 |
nunomrc |
flattening a JSON data structure |
Mon, 19 Oct, 15:08 |
Deenar Toraskar |
Re: Spark SQL Thriftserver and Hive UDF in Production |
Mon, 19 Oct, 15:22 |
Fernando Velasco |
k-prototypes in MLLib? |
Mon, 19 Oct, 15:38 |
Adrian Tanase |
Re: Spark handling parallel requests |
Mon, 19 Oct, 15:49 |
shahid ashraf |
Fwd: SHUFFLE in PARTITIONBY or shuffle in general |
Mon, 19 Oct, 15:57 |
Todd Nist |
Re: Spark SQL Thriftserver and Hive UDF in Production |
Mon, 19 Oct, 16:01 |
peay2 |
pyspark: results differ based on whether persist() has been called |
Mon, 19 Oct, 16:04 |
Jia Zhan |
Re: In-memory computing and cache() in Spark |
Mon, 19 Oct, 17:22 |
Jia Zhan |
Re: In-memory computing and cache() in Spark |
Mon, 19 Oct, 17:25 |
Shepherd |
How to calculate row by now and output retults in Spark |
Mon, 19 Oct, 17:35 |
Davies Liu |
Re: pyspark: results differ based on whether persist() has been called |
Mon, 19 Oct, 17:40 |
Davies Liu |
Re: best way to generate per key auto increment numerals after sorting |
Mon, 19 Oct, 17:45 |
fahad shah |
Re: best way to generate per key auto increment numerals after sorting |
Mon, 19 Oct, 17:52 |
Davies Liu |
Re: pyspark groupbykey throwing error: unpack requires a string argument of length 4 |
Mon, 19 Oct, 17:52 |
fahad shah |
Re: pyspark groupbykey throwing error: unpack requires a string argument of length 4 |
Mon, 19 Oct, 18:03 |
tarek.abouzei...@yahoo.com.INVALID |
Re: Spark handling parallel requests |
Mon, 19 Oct, 18:06 |
ahaider3 |
Storing Compressed data in HDFS into Spark |
Mon, 19 Oct, 18:13 |
Alex Nastetsky |
writing avro parquet |
Mon, 19 Oct, 18:14 |
gbop |
new 1.5.1 behavior - exception on executor throws ClassNotFound on driver |
Mon, 19 Oct, 18:15 |
Ted Yu |
Re: How to calculate row by now and output retults in Spark |
Mon, 19 Oct, 18:16 |
Ted Yu |
Re: new 1.5.1 behavior - exception on executor throws ClassNotFound on driver |
Mon, 19 Oct, 18:18 |
Lij Tapel |
Re: new 1.5.1 behavior - exception on executor throws ClassNotFound on driver |
Mon, 19 Oct, 18:26 |
Younes Naguib |
RE: Dynamic partition pruning |
Mon, 19 Oct, 18:33 |
franklyn |
Differentiate Spark streaming in event logs |
Mon, 19 Oct, 18:47 |
Ted Yu |
Re: new 1.5.1 behavior - exception on executor throws ClassNotFound on driver |
Mon, 19 Oct, 18:51 |
Adrian Tanase |
Re: How does shuffle work in spark ? |
Mon, 19 Oct, 18:56 |
Adrian Tanase |
Re: Differentiate Spark streaming in event logs |
Mon, 19 Oct, 18:58 |
Lij Tapel |
Re: new 1.5.1 behavior - exception on executor throws ClassNotFound on driver |
Mon, 19 Oct, 19:02 |
Adrian Tanase |
Re: How to calculate row by now and output retults in Spark |
Mon, 19 Oct, 19:03 |