sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "SATOSHI KONDO (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SQOOP-360) Duplication of the record in the case of using import mode
Date Wed, 02 Nov 2011 12:03:32 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel


    Comment: was deleted

(was: I maked patch for PostgreSQL.
I used ctid.
ctid is the physical location of the row version within its table.

The sqoop command is the following. 

$sqoop import --connect jdbc:postgresql://dbip:port/dbname --username username --password
password --table test --num-mappers 100 --split-by ctid --pgctid
> Duplication of the record in the case of using import mode 
> -----------------------------------------------------------
>                 Key: SQOOP-360
>                 URL: https://issues.apache.org/jira/browse/SQOOP-360
>             Project: Sqoop
>          Issue Type: Bug
>          Components: connectors
>    Affects Versions: 1.3.0
>            Reporter: SATOSHI KONDO
>              Labels: patch
>         Attachments: SQOOP-360.patch
> When I use Import mode of Sqoop,
> I get duplicate records.
> This occurs on condition of the following. 
> 1.Use Import mode of Sqoop
> 2.Set a character type to "split-by" parameter 
> 3.Set a big value to "num-mappers" parameter
> The big value is a relatively large thing as compared with the total number of records.

> For example, 
> when total number of records is 10,
> I set 100 to "num-mappers" parameter.
> I expect to get 10 records.
> But I get 10 or more lines.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message