drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shadi Khalifa <khal...@cs.queensu.ca>
Subject Re: [HANGOUT] Topics for 10/04/16
Date Tue, 04 Oct 2016 04:26:06 GMT
I have been working on integrating WEKA into Drill to support building and scoring classification
models. I have been successful in supporting all WEKA classifiers and making them run in a
distributed fashion over Drill 1.2. The classifier accuracy is not affected by running in
a distributed fashion and the training and scoring times are getting a huge boost using Drill.
A paper on this has been published in the IEEE symposium on Big Data in June 2016 [available: http://cs.queensu.ca/~khalifa/qdrill/QDrill_20160212IEEE_CameraReady.pdf]
and we are now in the process of publishing another paper in which QDrill supports all WEKA
algorithms. FYI, this can be easily extended to support clustering and other types of WEKA
algorithms. The architecture also allows supporting other data mining libraries.
The QDrill project website is  http://cs.queensu.ca/~khalifa/qdrill, the project downloadable
version on it is little bit old but I'm planning to upload a more updated stable version within
the next 10 days. I'm also using an SVN repository and planning to move the project to GitHub
to make it easier to get the latest Drill versions and to may be integrate with Drill at some
Unfortunately, I have another meeting tomorrow at the same time of the hangout, but I would
love to know your opinion and to discuss the process of evaluating this extension and may
be integrating it with Drill at some point. 
Shadi KhalifaPhD CandidateSchool of Computing Queen's University Canada
I'm just a neuron in the society collective brain

01001001 00100000 01101100 01101111 01110110 01100101 00100000 01000101 01100111 01111001
01110000 01110100 
P Please consider your environmental responsibility before printing this e-mail


    On Monday, October 3, 2016 10:52 PM, Laurent Goujon <laurent@dremio.com> wrote:


I'm currently working on improving metadata support for both the JDBC
driver and the C++ connector, more specifically the following JIRAs:

DRILL-4853: Update C++ protobuf source files
DRILL-4420: Server-side metadata and prepared-statement support for C++
DRILL-4880: Support JDBC driver registration using ServiceLoader
DRILL-4925: Add tableType filter to GetTables metadata query
DRILL-4730: Update JDBC DatabaseMetaData implementation to use new Metadata

I  already opened multiple pull requests for those (the list is available
at https://github.com/apache/drill/pulls/laurentgo)

I'm planning to join tomorrow hangout in case people have questions about



On Mon, Oct 3, 2016 at 10:28 AM, Subbu Srinivasan <ssrinivasan@zscaler.com>

> Can we close on https://github.com/apache/drill/pull/518 ?
> On Mon, Oct 3, 2016 at 10:27 AM, Sudheesh Katkam <sudheesh@apache.org>
> wrote:
> > Hi drillers,
> >
> > Our bi-weekly hangout is tomorrow (10/04/16, 10 AM PT). If you have any
> > suggestions for hangout topics, you can add them to this thread. We will
> > also ask around at the beginning of the hangout for topics.
> >
> > Thank you,
> > Sudheesh
> >

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message