apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Thomas Weise <...@apache.org>
Subject Python support
Date Thu, 15 Sep 2016 21:06:21 GMT

Python (not Jython) seems to be a popular language and frequently used for
data analysis, especially where flexibility matters. It has a comprehensive
library and it is generally considered low barrier to entry. I have also
seen Python used in critical back-end components, although that's probably
not very common?

I think Python support could potentially expand the user base for Apex.
There are 2 main areas that can be considered:

1) Support to execute Python code through an operator
2) A client API that lets users construct pipelines in Python

The former can exist without the latter. And it would enable users to
leverage existing code that otherwise would have to be rewritten in a JVM
language. The engine could ship scripts/packages so they are automatically
distributed on the cluster.

A useful client API probably requires back-end support for lambda functions
and more complex UDFs.

Would be great to get some feedback, especially from those that have
experience with Python, on how an integration could potentially open up new
use cases for Apex.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message