carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jacky Li (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CARBONDATA-1014) Refactor on data loading and open interface for format extension
Date Wed, 03 May 2017 08:39:04 GMT
Jacky Li created CARBONDATA-1014:
------------------------------------

             Summary: Refactor on data loading and open interface for format extension
                 Key: CARBONDATA-1014
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1014
             Project: CarbonData
          Issue Type: Improvement
            Reporter: Jacky Li
             Fix For: 1.2.0


Refactor on current data loading flow to make it:
1. Use vectorized processing as early as possible
2. Make index build (sorting) CPU cache efficient, by using rowId and key column vector to
sort
3. Open interface for format extension, including column encoding, compression, statistics.

Design doc will be posted in this JIRA soon.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message