tajo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Min Zhou (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TAJO-472) Umbrella ticket for accelerating query speed through memory cached table
Date Fri, 03 Jan 2014 00:16:51 GMT
Min Zhou created TAJO-472:
-----------------------------

             Summary: Umbrella ticket for accelerating query speed through memory cached table
                 Key: TAJO-472
                 URL: https://issues.apache.org/jira/browse/TAJO-472
             Project: Tajo
          Issue Type: New Feature
          Components: distributed query plan, physical operator
            Reporter: Min Zhou
            Assignee: Min Zhou


Previously, I was involved as a technical expert into an in-memory database for on-line businesses
in Alibaba group. That's  an internal project, which can do group by aggregation on billions
of rows in less than 1 second.  

I'd like to apply this technology into tajo, make it much faster than it is. From some benchmark,
 we believe that spark&shark currently is the fastest solution among all the open source
interactive query system , such as impala, presto, tajo.  The main reason is that it benefit
from in-memory data. 

I will take memory cached table as my first step to  accelerate query speed of tajo. Actually
, this is the reason why I concerned at table partition during Xmas and new year holidays.


Will submit a proposal soon.



  



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message