hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [incubator-hudi] RonBarabash commented on issue #1202: [SUPPORT] com.uber.hoodie.exception.HoodieIOException: IOException when reading log file + Corrupted Log File
Date Tue, 14 Jan 2020 13:11:12 GMT
RonBarabash commented on issue #1202: [SUPPORT] com.uber.hoodie.exception.HoodieIOException:
IOException when reading log file + Corrupted Log File
URL: https://github.com/apache/incubator-hudi/issues/1202#issuecomment-574166330
 
 
   Input -> kafka, CDC events logs generated by debezium
   Processing -> spark structured streaming, we do some spark sql on the events
   Output -> Writing using spark to s3 with Hudi MergeOnRead
   this is are the hudi config:
   ``` "options": {
           "hoodie.compaction.strategy": "com.uber.hoodie.io.compact.strategy.UnBoundedCompactionStrategy",
           "hoodie.fail.on.timeline.archiving": "false",
           "hoodie.cleaner.commits.retained": "1",
           "hoodie.datasource.hive_sync.enable": "false",
           "hoodie.copyonwrite.record.size.estimate": "60",
           "hoodie.copyonwrite.insert.auto.split": "true",
           "hoodie.parquet.compression.codec": "snappy",
           "hoodie.index.bloom.num_entries": "1000000",
           "hoodie.compact.inline.max.delta.commits": "1",
         }```

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

Mime
View raw message