hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rakesh R (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-11338) [SPS]: Fix timeout issue in unit tests caused by longger NN down time
Date Mon, 10 Apr 2017 04:05:41 GMT

    [ https://issues.apache.org/jira/browse/HDFS-11338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15962408#comment-15962408
] 

Rakesh R commented on HDFS-11338:
---------------------------------

Thanks [~umamaheswararao] for the offline discussions.

IMHO, instead of increasing per test case time out how about an idea to reduce the impact
of SPS module inclusion. Attached patch is an attempt to change the interrupt and thread joining
sequence to avoid {{>3secs}} extra waiting period for every SPS stop operation. I could
see the {{TestDFSStripedOutputStreamWithFailure#runTestWithMultipleFailure}} test logic is
iterating approax 16 times. During each iteration it is calling {{settup}} and {{teardown}}
functions (which internally does start & stop NN server). IIUC, this is adding an extra
timed waiting period of 16 * 3secs = 48secs and causing some of these test failures.

Lets see the test case improvements in the jenkins.

> [SPS]: Fix timeout issue in unit tests caused by longger NN down time
> ---------------------------------------------------------------------
>
>                 Key: HDFS-11338
>                 URL: https://issues.apache.org/jira/browse/HDFS-11338
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: datanode, namenode
>            Reporter: Wei Zhou
>            Assignee: Wei Zhou
>         Attachments: HDFS-11338-HDFS-10285.00.patch, HDFS-11338-HDFS-10285.01.patch,
HDFS-11338-HDFS-10285-02.patch
>
>
> As discussed in HDFS-11186, it takes longer to stop NN:
> {code}
> try {
>   storagePolicySatisfierThread.join(3000);
> } catch (InterruptedException ie) {
> }
> {code}
> So, it takes longer time to finish some tests and this leads to the timeout failures.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message