hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hive/Roadmap" by JeffHammerbacher
Date Thu, 30 Oct 2008 06:43:22 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by JeffHammerbacher:
http://wiki.apache.org/hadoop/Hive/Roadmap

------------------------------------------------------------------------------
  
  = 10/27/08 Roadmap Update =
  
- 1. Integrating Dynamic SerDe with the DDL. (Zheng/Pete) - This allows the users to create
typed tables along with list and map types from the DDL
+ # Integrating Dynamic SerDe with the DDL. (Zheng/Pete) - This allows the users to create
typed tables along with list and map types from the DDL
- 2. Support for Statistics. (Ashish) - These stats are needed to make optimization decisions
+ # Support for Statistics. (Ashish) - These stats are needed to make optimization decisions
- 3. Join Optimizations. (Prasad) - Mapside joins, semi join techniques etc to do the join
faster
+ # Join Optimizations. (Prasad) - Mapside joins, semi join techniques etc to do the join
faster
- 4. Predicate Pushdown Optimizations. (Namit) - pushing predicates just above the table scan
for certain situations in joins as well as ensuring that only required columns are sent across
map/reduce boundaries
+ # Predicate Pushdown Optimizations. (Namit) - pushing predicates just above the table scan
for certain situations in joins as well as ensuring that only required columns are sent across
map/reduce boundaries
- 5. Group By Optimizations. (Joydeep) - various optimizations to make group by faster
+ # Group By Optimizations. (Joydeep) - various optimizations to make group by faster
- 6. Optimizations to reduce the number of map files created by filter operations. (Dhrubha)
- Filters with a large number of mappers produces a lot of files which slows down the following
operations. This tries to address problems with that.
+ # Optimizations to reduce the number of map files created by filter operations. (Dhrubha)
- Filters with a large number of mappers produces a lot of files which slows down the following
operations. This tries to address problems with that.
- 7. Transformations in LOAD. (Joydeep) - LOAD currently does not transform the input data
if it is not in the format expected by the destination table.
+ # Transformations in LOAD. (Joydeep) - LOAD currently does not transform the input data
if it is not in the format expected by the destination table.
- 8. Schemaless map/reduce. (Zheng) - TRANSFORM needs schema while map/reduce is schema less.
+ # Schemaless map/reduce. (Zheng) - TRANSFORM needs schema while map/reduce is schema less.
- 9. Improvements to TRANSFORM. (Zheng) - Make this more intuitive to map/reduce developers
- evaluate some other keywords etc..
+ # Improvements to TRANSFORM. (Zheng) - Make this more intuitive to map/reduce developers
- evaluate some other keywords etc..
- 10. Error Reporting Improvements. (Pete) - Make error reporting for parse errors better
+ # Error Reporting Improvements. (Pete) - Make error reporting for parse errors better
- 11. Help on CLI. (Joydeep) - add help to the CLI
+ # Help on CLI. (Joydeep) - add help to the CLI
- 12. Explode and Collect Operators. (Zheng) - Explode and collect operators to convert collections
to individual items and vice versa.
+ # Explode and Collect Operators. (Zheng) - Explode and collect operators to convert collections
to individual items and vice versa.
- 13. Propagating sort properties to destination tables. (Prasad) - If the query produces
sorted we want to capture that in the destination table's metadata so that downstream optimizations
can be enabled.
+ # Propagating sort properties to destination tables. (Prasad) - If the query produces sorted
we want to capture that in the destination table's metadata so that downstream optimizations
can be enabled.
  
  Other contributions from outside FB ...
- 1. JDBC driver (Michi Mutsuzaki @ stanford.edu, Raghu @ stanford.edu)
+ # JDBC driver (Michi Mutsuzaki @ stanford.edu, Raghu @ stanford.edu)
- 2. Fixes to CLI driver (Jeremy Huylebroeck)
+ # Fixes to CLI driver (Jeremy Huylebroeck)
- 3. Web interface...
+ # Web interface...
  
  = Roadmap/call to add more features =
  The following is the list of useful features that are on the Hive Roadmap:

Mime
View raw message