[ https://issues.apache.org/jira/browse/SQOOP-360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
SATOSHI KONDO updated SQOOP-360:
--------------------------------
Attachment: SQOOP-360.patch
I maked patch for PostgreSQL.
I used ctid.
ctid is the physical location of the row version within its table.
The sqoop command is the following.
$sqoop import --connect jdbc:postgresql://dbip:port/dbname --username username --password
password --table test --num-mappers 100 --split-by ctid --pgctid
> Duplication of the record in the case of using import mode
> -----------------------------------------------------------
>
> Key: SQOOP-360
> URL: https://issues.apache.org/jira/browse/SQOOP-360
> Project: Sqoop
> Issue Type: Bug
> Components: connectors
> Affects Versions: 1.3.0
> Reporter: SATOSHI KONDO
> Labels: patch
> Attachments: SQOOP-360.patch
>
>
> When I use Import mode of Sqoop,
> I get duplicate records.
> This occurs on condition of the following.
> 1.Use Import mode of Sqoop
> 2.Set a character type to "split-by" parameter
> 3.Set a big value to "num-mappers" parameter
> The big value is a relatively large thing as compared with the total number of records.
> For example,
> when total number of records is 10,
> I set 100 to "num-mappers" parameter.
> I expect to get 10 records.
> But I get 10 or more lines.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
|