hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xiao Chen (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (HDFS-12383) Re-encryption updater should handle canceled tasks better
Date Fri, 01 Sep 2017 03:45:00 GMT

    [ https://issues.apache.org/jira/browse/HDFS-12383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16149993#comment-16149993
] 

Xiao Chen edited comment on HDFS-12383 at 9/1/17 3:44 AM:
----------------------------------------------------------

Test failure unrelated, just triggered another run in case.
Also found TestCryptoAdminCLI is flaky, will fix separately.


was (Author: xiaochen):
Test failure unrelated, just triggered another run in case.

> Re-encryption updater should handle canceled tasks better
> ---------------------------------------------------------
>
>                 Key: HDFS-12383
>                 URL: https://issues.apache.org/jira/browse/HDFS-12383
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: encryption
>    Affects Versions: 3.0.0-beta1
>            Reporter: Xiao Chen
>            Assignee: Xiao Chen
>         Attachments: HDFS-12383.01.patch, HDFS-12383.02.patch
>
>
> Seen an instance where the re-encryption updater exited due to an exception, and later
tasks no longer executes. Logs below:
> {noformat}
> 2017-08-31 09:54:08,104 INFO org.apache.hadoop.hdfs.server.namenode.EncryptionZoneManager:
Zone /tmp/encryption-zone-3(16819) is submitted for re-encryption.
> 2017-08-31 09:54:08,104 INFO org.apache.hadoop.hdfs.server.namenode.ReencryptionHandler:
Executing re-encrypt commands on zone 16819. Current zones:[zone:16787 state:Completed lastProcessed:null
filesReencrypted:1 fileReencryptionFailures:0][zone:16813 state:Completed lastProcessed:null
filesReencrypted:1 fileReencryptionFailures:0][zone:16819 state:Submitted lastProcessed:null
filesReencrypted:0 fileReencryptionFailures:0]
> 2017-08-31 09:54:08,105 INFO org.apache.hadoop.hdfs.protocol.ReencryptionStatus: Zone
16819 starts re-encryption processing
> 2017-08-31 09:54:08,105 INFO org.apache.hadoop.hdfs.server.namenode.ReencryptionHandler:
Re-encrypting zone /tmp/encryption-zone-3(id=16819)
> 2017-08-31 09:54:08,105 INFO org.apache.hadoop.hdfs.server.namenode.ReencryptionHandler:
Submitted batch (start:/tmp/encryption-zone-3/data1, size:1) of zone 16819 to re-encrypt.
> 2017-08-31 09:54:08,105 INFO org.apache.hadoop.hdfs.server.namenode.ReencryptionHandler:
Submission completed of zone 16819 for re-encryption.
> 2017-08-31 09:54:08,105 INFO org.apache.hadoop.hdfs.server.namenode.ReencryptionHandler:
Processing batched re-encryption for zone 16819, batch size 1, start:/tmp/encryption-zone-3/data1
> 2017-08-31 09:54:08,979 INFO BlockStateChange: BLOCK* BlockManager: ask 172.26.1.71:20002
to delete [blk_1073742291_1467]
> 2017-08-31 09:54:18,295 INFO org.apache.hadoop.hdfs.server.namenode.ReencryptionUpdater:
Cancelling 1 re-encryption tasks
> 2017-08-31 09:54:18,295 INFO org.apache.hadoop.hdfs.server.namenode.EncryptionZoneManager:
Cancelled zone /tmp/encryption-zone-3(16819) for re-encryption.
> 2017-08-31 09:54:18,295 INFO org.apache.hadoop.hdfs.protocol.ReencryptionStatus: Zone
16819 completed re-encryption.
> 2017-08-31 09:54:18,296 INFO org.apache.hadoop.hdfs.server.namenode.ReencryptionHandler:
Completed re-encrypting one batch of 1 edeks from KMS, time consumed: 10.19 s, start: /tmp/encryption-zone-3/data1.
> 2017-08-31 09:54:18,296 ERROR org.apache.hadoop.hdfs.server.namenode.ReencryptionUpdater:
Re-encryption updater thread exiting.
> java.util.concurrent.CancellationException
>         at java.util.concurrent.FutureTask.report(FutureTask.java:121)
>         at java.util.concurrent.FutureTask.get(FutureTask.java:192)
>         at org.apache.hadoop.hdfs.server.namenode.ReencryptionUpdater.takeAndProcessTasks(ReencryptionUpdater.java:404)
>         at org.apache.hadoop.hdfs.server.namenode.ReencryptionUpdater.run(ReencryptionUpdater.java:250)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>         at java.lang.Thread.run(Thread.java:745)
> {noformat}
> Updater should be fixed to handle canceled tasks better.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message