Dmitry Zagorulkin created SQOOP-3123:
----------------------------------------
Summary: Import from oracle using oraoop with map-column-java to avro fails if
special characters encounter in table name or column name
Key: SQOOP-3123
URL: https://issues.apache.org/jira/browse/SQOOP-3123
Project: Sqoop
Issue Type: Bug
Components: connectors/oracle
Affects Versions: 1.4.6, 1.4.7
Reporter: Dmitry Zagorulkin
I'm trying to import data from oracle to avro using oraoop.
My table:
{code}
CREATE TABLE "IBS"."BRITISH#CATS"
( "ID" NUMBER,
"C_CODE" VARCHAR2(10),
"C_USE_START#DATE" DATE,
"C_USE_USE#NEXT_DAY" VARCHAR2(1),
"C_LIM_MIN#DAT" DATE,
"C_LIM_MIN#TIME" TIMESTAMP,
"C_LIM_MIN#SUM" NUMBER,
"C_OWNCODE" VARCHAR2(1),
"C_LIMIT#SUM_LIMIT" NUMBER(17,2),
"C_L@M" NUMBER(17,2),
"C_1_THROW" NUMBER NOT NULL ENABLE,
"C_#_LIMITS" NUMBER NOT NULL ENABLE
) SEGMENT CREATION IMMEDIATE
PCTFREE 70 PCTUSED 40 INITRANS 2 MAXTRANS 255
NOCOMPRESS LOGGING
STORAGE(INITIAL 2097152 NEXT 524288 MINEXTENTS 1 MAXEXTENTS 2147483645
PCTINCREASE 0 FREELISTS 1 FREELIST GROUPS 1
BUFFER_POOL DEFAULT FLASH_CACHE DEFAULT CELL_FLASH_CACHE DEFAULT)
TABLESPACE "WORK" ;
{code}
My first script is:
{code}
./sqoop import \
-Doraoop.timestamp.string=false \
--direct \
--connect jdbc:oracle:thin:@localhost:49161:XE \
--username system \
--password oracle \
--table IBS.BRITISH#CATS \
--target-dir /Users/Dmitry/Developer/Java/sqoop/bin/imported \
--as-avrodatafile \
--map-column-java ID=String,C_CODE=String,C_USE_START#DATE=String,C_USE_USE#NEXT_DAY=String,C_LIM_MIN#DAT=String,C_LIM_MIN#TIME=String,C_LIM_MIN#SUM=String,C_OWNCODE=String,C_LIMIT#SUM_LIMIT=String,C_L_M=String,C_1_THROW=String,C_#_LIMITS=String
{code}
fails with
{code}
2017-01-13 16:11:21,348 ERROR [main] tool.ImportTool (ImportTool.java:run(625)) - Import failed:
No column by the name C_LIMIT#SUM_LIMITfound while importing data; expecting one of [C_LIMIT_SUM_LIMIT,
C_OWNCODE, C_L_M, C___LIMITS, C_LIM_MIN_DAT, C_1_THROW, C_CODE, C_USE_START_DATE, C_LIM_MIN_SUM,
ID, C_LIM_MIN_TIME, C_USE_USE_NEXT_DAY]
{code}
After i've found that sqoop has replaced all special characters with underscore. My second
script is:
{code}
./sqoop import \
-D oraoop.timestamp.string=false \
--direct \
--connect jdbc:oracle:thin:@localhost:49161:XE \
--username system \
--password oracle \
--table IBS.BRITISH#CATS \
--target-dir /Users/Dmitry/Developer/Java/sqoop/bin/imported \
--as-avrodatafile \
--map-column-java ID=String,C_CODE=String,C_USE_START_DATE=String,C_USE_USE_NEXT_DAY=String,C_LIM_MIN_DAT=String,C_LIM_MIN_TIME=String,C_LIM_MIN_SUM=String,C_OWNCODE=String,C_LIMIT_SUM_LIMIT=String,C_L_M=String,C_1_THROW=String,C___LIMITS=String
\
--verbose
{code}
Fails with: Caused by: org.apache.avro.UnresolvedUnionException: Not in union ["null","long"]:
2017-01-13 11:22:53.0
{code}
2017-01-13 16:14:54,687 WARN [Thread-26] mapred.LocalJobRunner (LocalJobRunner.java:run(560))
- job_local1372531461_0001
java.lang.Exception: org.apache.avro.file.DataFileWriter$AppendWriteException: org.apache.avro.UnresolvedUnionException:
Not in union ["null","long"]: 2017-01-13 11:22:53.0
at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:522)
Caused by: org.apache.avro.file.DataFileWriter$AppendWriteException: org.apache.avro.UnresolvedUnionException:
Not in union ["null","long"]: 2017-01-13 11:22:53.0
at org.apache.avro.file.DataFileWriter.append(DataFileWriter.java:308)
at org.apache.sqoop.mapreduce.AvroOutputFormat$1.write(AvroOutputFormat.java:112)
at org.apache.sqoop.mapreduce.AvroOutputFormat$1.write(AvroOutputFormat.java:108)
at org.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.write(MapTask.java:655)
at org.apache.hadoop.mapreduce.task.TaskInputOutputContextImpl.write(TaskInputOutputContextImpl.java:89)
at org.apache.hadoop.mapreduce.lib.map.WrappedMapper$Context.write(WrappedMapper.java:112)
at org.apache.sqoop.mapreduce.AvroImportMapper.map(AvroImportMapper.java:73)
at org.apache.sqoop.mapreduce.AvroImportMapper.map(AvroImportMapper.java:39)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:145)
at org.apache.sqoop.mapreduce.AutoProgressMapper.run(AutoProgressMapper.java:64)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:784)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:243)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.avro.UnresolvedUnionException: Not in union ["null","long"]: 2017-01-13
11:22:53.0
at org.apache.avro.generic.GenericData.resolveUnion(GenericData.java:709)
at org.apache.avro.generic.GenericDatumWriter.resolveUnion(GenericDatumWriter.java:192)
at org.apache.avro.generic.GenericDatumWriter.writeWithoutConversion(GenericDatumWriter.java:110)
at org.apache.avro.generic.GenericDatumWriter.write(GenericDatumWriter.java:73)
at org.apache.avro.reflect.ReflectDatumWriter.write(ReflectDatumWriter.java:150)
at org.apache.avro.generic.GenericDatumWriter.writeField(GenericDatumWriter.java:153)
at org.apache.avro.specific.SpecificDatumWriter.writeField(SpecificDatumWriter.java:90)
at org.apache.avro.reflect.ReflectDatumWriter.writeField(ReflectDatumWriter.java:182)
at org.apache.avro.generic.GenericDatumWriter.writeRecord(GenericDatumWriter.java:143)
at org.apache.avro.generic.GenericDatumWriter.writeWithoutConversion(GenericDatumWriter.java:105)
at org.apache.avro.generic.GenericDatumWriter.write(GenericDatumWriter.java:73)
at org.apache.avro.reflect.ReflectDatumWriter.write(ReflectDatumWriter.java:150)
at org.apache.avro.generic.GenericDatumWriter.write(GenericDatumWriter.java:60)
at org.apache.avro.file.DataFileWriter.append(DataFileWriter.java:302)
... 17 more
{code}
I've found that old problem and "oraoop.timestamp.string=false" must solve it, but it does
not.
What do you think?
Also please assign this problem to me.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
|