cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam Holmberg (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-9304) COPY TO improvements
Date Thu, 05 Nov 2015 15:46:27 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-9304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991849#comment-14991849
] 

Adam Holmberg commented on CASSANDRA-9304:
------------------------------------------

bq. If this is not enough and you want to debug some more ... you can use the 2.1 patch attached
I do think we should debug more.

bq. I am receiving str objects rather than bytearray objects for blob db values.
I'm sorry I didn't key in on this earlier, but the driver never returns bytearray for blob
types. It returns the binary type of the runtime ({{str}} for Python 2, {{bytes}} for Python
3). I don't think this should change.

I suspect that you're losing this patch when the child processes are created:
https://github.com/apache/cassandra/blob/f62d6e407d859c0c8da5283902a5153e4bd67b5c/bin/cqlsh#L678-L680

I'm sure you're aware, but the {{multiprocessing}} module has some [Windows-related idiosyncrasies|https://docs.python.org/2/library/multiprocessing.html?highlight=windows].

> COPY TO improvements
> --------------------
>
>                 Key: CASSANDRA-9304
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-9304
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jonathan Ellis
>            Assignee: Stefania
>            Priority: Minor
>              Labels: cqlsh
>             Fix For: 3.x, 2.1.x, 2.2.x
>
>
> COPY FROM has gotten a lot of love.  COPY TO not so much.  One obvious improvement could
be to parallelize reading and writing (write one page of data while fetching the next).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message