couchdb-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jan Lehnardt (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (COUCHDB-1986) 04-replication-large-atts.t times out
Date Mon, 16 Dec 2013 16:19:10 GMT

    [ https://issues.apache.org/jira/browse/COUCHDB-1986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13849272#comment-13849272
] 

Jan Lehnardt commented on COUCHDB-1986:
---------------------------------------

Some more testing today, bumping the test timeout waiting for replication to finish from 300000
to 600000 makes this pass again.

Looking at the request log can see that PUTing docs to the remote db takes disproportionally
long (1 minute + a few seconds each):

[Mon, 16 Dec 2013 12:23:27 GMT] [info] [<0.321.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/_local/8831f8738b3333d5d17e00da7
1581b23 201
[Mon, 16 Dec 2013 12:24:30 GMT] [info] [<0.322.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/doc5?new_edits=false
201
[Mon, 16 Dec 2013 12:25:38 GMT] [info] [<0.322.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/doc4?new_edits=false
201
[Mon, 16 Dec 2013 12:26:51 GMT] [info] [<0.322.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/doc3?new_edits=false
201
[Mon, 16 Dec 2013 12:27:58 GMT] [info] [<0.322.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/doc2?new_edits=false
201
[Mon, 16 Dec 2013 12:29:06 GMT] [info] [<0.322.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/doc11?new_edits=false
201
[Mon, 16 Dec 2013 12:30:13 GMT] [info] [<0.322.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/doc10?new_edits=false
201
[Mon, 16 Dec 2013 12:31:20 GMT] [info] [<0.322.0>] 127.0.0.1 - - PUT /couch_test_rep_db_b/doc1?new_edits=false
201
[Mon, 16 Dec 2013 12:31:20 GMT] [info] [<0.322.0>] 127.0.0.1 - - POST /couch_test_rep_db_b/_ensure_full_commit
201

After the last one all other requests come in at expected speeds.

I’m not too familiar with this part of the replicator code, so I couldn’t quickly find
why either the requests take that long, or why the replicator waits a minute until it continues.

Another thing I tried was seeing whether the new R16B scheduling behaviour introduced this,
but running the test with

> ERL_FLAGS="+sws legacy" ./test/etap/run -v src/couch_replicator/test/04-replication-large-atts.t

and the original timeout still times out.


> 04-replication-large-atts.t times out
> -------------------------------------
>
>                 Key: COUCHDB-1986
>                 URL: https://issues.apache.org/jira/browse/COUCHDB-1986
>             Project: CouchDB
>          Issue Type: Bug
>          Components: Replication
>    Affects Versions: 1.5.0
>            Reporter: Jan Lehnardt
>
> 04-replication-large-atts.t gets stuck around 558, sometimes a little earlier or later,
but it times out eventually, regardless of the timeout. I tried doubling and such.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Mime
View raw message