spark-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jkbradley <>
Subject [GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing
Date Wed, 28 Sep 2016 22:18:24 GMT
Github user jkbradley commented on the issue:
    > Our use case is mainly using similarity join to find fraud trips. I think I can change
the NN-search to only single-probing NN search of dataframe if you think it's fine. What do
you think?
    I'd stick with what I said before: I like the current methods provided for approxNearestNeighbors
and approxSimilarityJoin, but we could add a version of approxNearestNeighbors taking a DataFrame
instead of a single key in a future PR.

If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at or file a JIRA ticket
with INFRA.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message