apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chandni Singh (JIRA)" <j...@apache.org>
Subject [jira] [Created] (APEXMALHAR-2063) Integrate WAL to FS WindowDataManager
Date Fri, 22 Apr 2016 17:27:12 GMT
Chandni Singh created APEXMALHAR-2063:
-----------------------------------------

             Summary: Integrate WAL to FS WindowDataManager
                 Key: APEXMALHAR-2063
                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2063
             Project: Apache Apex Malhar
          Issue Type: Improvement
            Reporter: Chandni Singh
            Assignee: Chandni Singh


FS Window Data Manager is used to save meta-data that helps in replaying tuples every completed
application window after failure. For this it saves meta-data in a file per window. Having
multiple small size files on hdfs cause issues as highlighted here:
http://blog.cloudera.com/blog/2009/02/the-small-files-problem/

Instead FS Window Data Manager can utilize the WAL to write data and maintain a mapping of
how much data was flushed to WAL each window.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message