hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Arun C Murthy (JIRA)" <j...@apache.org>
Subject [jira] Updated: (MAPREDUCE-775) Add input/output formatters for Vertica clustered ADBMS.
Date Mon, 03 Aug 2009 16:39:15 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Arun C Murthy updated MAPREDUCE-775:

    Status: Open  (was: Patch Available)

Most likely one of the newer files in the patch is missing the Apache License header...

> Add input/output formatters for Vertica clustered ADBMS.
> --------------------------------------------------------
>                 Key: MAPREDUCE-775
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-775
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>            Reporter: Omer Trajman
>             Fix For: 0.21.0
>         Attachments: MAPREDUCE-775.patch
> Add native support for Vertica as an input or output format taking advantage of parallel
read and write properties of the DBMS.
> On the input side allow for parametrized queries (a la prepared statements) and create
a split for each combination of parameters.  Also support the parameter list to be generated
from a sql statement.  For example - return metrics for all dimensions that meet criteria
X with one input split for each dimension.  Divide the read among any number of hosts in the
Vertica cluster.
> On the output side, support Vertica streaming load to any number of hosts in the Vertica
cluster.  Output may be to a different cluster than input.
> Also includes Input and Output formatters that support streaming interface.
> Code has been tested and run on live systems under 19 and 20.  Patch for 21 with new
API will be ready end of this week.  

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message