hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Omer Trajman (JIRA)" <j...@apache.org>
Subject [jira] Updated: (MAPREDUCE-775) Add input/output formatters for Vertica clustered ADBMS.
Date Tue, 04 Aug 2009 20:54:15 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Omer Trajman updated MAPREDUCE-775:
-----------------------------------

    Status: Patch Available  (was: Open)

take 2

> Add input/output formatters for Vertica clustered ADBMS.
> --------------------------------------------------------
>
>                 Key: MAPREDUCE-775
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-775
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>            Reporter: Omer Trajman
>             Fix For: 0.21.0
>
>         Attachments: MAPREDUCE-775.patch
>
>
> Add native support for Vertica as an input or output format taking advantage of parallel
read and write properties of the DBMS.
>  
> On the input side allow for parametrized queries (a la prepared statements) and create
a split for each combination of parameters.  Also support the parameter list to be generated
from a sql statement.  For example - return metrics for all dimensions that meet criteria
X with one input split for each dimension.  Divide the read among any number of hosts in the
Vertica cluster.
>  
> On the output side, support Vertica streaming load to any number of hosts in the Vertica
cluster.  Output may be to a different cluster than input.
>  
> Also includes Input and Output formatters that support streaming interface.
> Code has been tested and run on live systems under 19 and 20.  Patch for 21 with new
API will be ready end of this week.  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message