cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Pak (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-6927) Create a CQL3 based bulk OutputFormat
Date Thu, 27 Mar 2014 14:41:16 GMT


Paul Pak commented on CASSANDRA-6927:

One aspect that doesn't line up perfectly is that CQLSSTableWriter.addRow() and .rawAddRow()
methods simply take the column values or name-value pairs as parameters into the stored procedure,
while Hadoop's RecordWriter.write() method separates its parameters by keys and values.  My
plan is to have the new writer typed with <List<ByteBuffer>, List<ByteBuffer>>,
and when the .write(List<ByteBuffer>, List<ByteBuffer>) method internally calls
CQLSSTableWriter.rawAddRow(List<ByteBuffer>), just append the values list to the keys

> Create a CQL3 based bulk OutputFormat
> -------------------------------------
>                 Key: CASSANDRA-6927
>                 URL:
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Hadoop
>            Reporter: Paul Pak
>            Priority: Minor
>              Labels: cql3, hadoop
> This is the CQL compatible version of BulkOutputFormat.  CqlOutputFormat exists, but
doesn't write SSTables directly, similar to ColumnFamilyOutputFormat for thrift.

This message was sent by Atlassian JIRA

View raw message