spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ruslan Dautkhanov (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (SPARK-10935) Avito Context Ad Clicks
Date Sat, 06 Feb 2016 23:16:39 GMT

    [ https://issues.apache.org/jira/browse/SPARK-10935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136053#comment-15136053
] 

Ruslan Dautkhanov edited comment on SPARK-10935 at 2/6/16 11:16 PM:
--------------------------------------------------------------------

I noticed outer joins. Spark before 1.6 used cartesian product to produce outer joins - SPARK-11111.
That will not work well with larger datasets.


was (Author: tagar):
I noticed outer joins. Spark before 1.5 used cartesian product to produce outer joins - SPARK-11111.
That will not work well with larger datasets. Fixed in 1.6.

> Avito Context Ad Clicks
> -----------------------
>
>                 Key: SPARK-10935
>                 URL: https://issues.apache.org/jira/browse/SPARK-10935
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML
>            Reporter: Xiangrui Meng
>
> From [~kplazo@gmail.com]:
> I would love to do Avito Context Ad Clicks - https://www.kaggle.com/c/avito-context-ad-clicks
- but it involves a lot of feature engineering and preprocessing. I would love to split this
with somebody else if anybody is interested on working with this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message