hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eugene Koifman (JIRA)" <>
Subject [jira] [Created] (HIVE-17215) Streaming Ingest API writing unbucketed tables
Date Mon, 31 Jul 2017 17:48:00 GMT
Eugene Koifman created HIVE-17215:

             Summary: Streaming Ingest API writing unbucketed tables
                 Key: HIVE-17215
             Project: Hive
          Issue Type: Sub-task
          Components: Transactions
            Reporter: Eugene Koifman
            Assignee: Eugene Koifman

Currently the API expects the target table to be bucketed.
It creates 1 writer per bucket per connection/partition.
The simplest is to allow the API to create a single writer for unbucketed tables.  
If this doesn't provide enough write throughput, the client can create another connection.

Could add a parameter to the API to specify writer parallelism for unbucketed tables.  If
it's set to 2 for example, the writer will write delta_x_y_0000 and delta_x_y_00001 using
statementId.  Maybe as a followup.

This message was sent by Atlassian JIRA

View raw message