cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Johan Oskarsson (JIRA)" <j...@apache.org>
Subject [jira] Created: (CASSANDRA-890) Get Hadoop input format sub splits in parallel
Date Sat, 13 Mar 2010 16:14:27 GMT
Get Hadoop input format sub splits in parallel
----------------------------------------------

                 Key: CASSANDRA-890
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-890
             Project: Cassandra
          Issue Type: Improvement
          Components: Contrib
            Reporter: Johan Oskarsson


To improve Hadoop job startup time we can multithread parts of the input format. Specifically
the fetching of "sub splits" from many nodes can be run in parallel.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message