carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jacky Li (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (CARBONDATA-318) Implement an ExternalSorter that makes maximum usage of memory while sorting
Date Sat, 15 Oct 2016 05:40:20 GMT

     [ https://issues.apache.org/jira/browse/CARBONDATA-318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jacky Li updated CARBONDATA-318:
--------------------------------
    Description: 
External Sorter should sort in memory until it reach configured size, then spill to disk.
It should provide following interface:
1. insertRow/insertRowBatch: insert rows into the sorter
2. getIterator: return an iterator that iterate on sorted rows

External Sorter depends on FileWriterFactory to get a FileWriter to spill data into files.
FileWriterFactory should be provided by configuration. Multiple implementations are possible,
like writing into one folder or multiple folders

  was:
External Sorter should sort in memory until it reach configured size, then spill to disk.
It should provide following interface:
1. insertRow/insertRowBatch: insert rows into the sorter
2. getIterator: return an iterator that iterate on sorted rows

External Sorter depends on FileWriterFactory to get a FileWriter to spill data into files.
FileWriterFactory should be provided by user. Multiple implementations are possible, like
writing into one folder or multiple folders


> Implement an ExternalSorter that makes maximum usage of memory while sorting
> ----------------------------------------------------------------------------
>
>                 Key: CARBONDATA-318
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-318
>             Project: CarbonData
>          Issue Type: Sub-task
>            Reporter: Jacky Li
>             Fix For: 0.2.0-incubating
>
>
> External Sorter should sort in memory until it reach configured size, then spill to disk.
It should provide following interface:
> 1. insertRow/insertRowBatch: insert rows into the sorter
> 2. getIterator: return an iterator that iterate on sorted rows
> External Sorter depends on FileWriterFactory to get a FileWriter to spill data into files.
FileWriterFactory should be provided by configuration. Multiple implementations are possible,
like writing into one folder or multiple folders



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message