hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joydeep Sen Sarma (JIRA)" <j...@apache.org>
Subject [jira] Created: (HIVE-1486) optimize estimation of number of reducers and local mode
Date Sun, 25 Jul 2010 15:56:51 GMT
optimize estimation of number of reducers and local mode
--------------------------------------------------------

                 Key: HIVE-1486
                 URL: https://issues.apache.org/jira/browse/HIVE-1486
             Project: Hadoop Hive
          Issue Type: Improvement
          Components: Query Processor
            Reporter: Joydeep Sen Sarma


Hive uses file system metadata to estimate number of reducers and to determine if jobs can
be executed locally. This currently looks up file system metadata about each path serially
and can take a long time in case number of files is very high.

instead we can lookup part of the input space and try to approximate the size etc. summaries


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message