hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-13014) RetryingMetaStoreClient is retrying too aggresievley
Date Fri, 20 Jan 2017 20:38:26 GMT

    [ https://issues.apache.org/jira/browse/HIVE-13014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15832372#comment-15832372
] 

Hive QA commented on HIVE-13014:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12848366/HIVE-13014.07.patch

{color:green}SUCCESS:{color} +1 due to 3 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 10 failed/errored test(s), 10968 tests executed
*Failed tests:*
{noformat}
TestDerbyConnector - did not produce a TEST-*.xml file (likely timed out) (batchId=235)
org.apache.hadoop.hive.cli.TestEncryptedHDFSCliDriver.testCliDriver[encryption_join_with_different_encryption_keys]
(batchId=159)
org.apache.hadoop.hive.cli.TestHBaseNegativeCliDriver.testCliDriver[cascade_dbdrop] (batchId=226)
org.apache.hadoop.hive.cli.TestHBaseNegativeCliDriver.testCliDriver[generatehfiles_require_family_path]
(batchId=226)
org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[orc_ppd_basic] (batchId=135)
org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[orc_ppd_schema_evol_3a] (batchId=136)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[escape1] (batchId=139)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[escape2] (batchId=154)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[limit_pushdown3] (batchId=144)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_text_vec_part]
(batchId=149)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/3073/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/3073/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-3073/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 10 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12848366 - PreCommit-HIVE-Build

> RetryingMetaStoreClient is retrying too aggresievley
> ----------------------------------------------------
>
>                 Key: HIVE-13014
>                 URL: https://issues.apache.org/jira/browse/HIVE-13014
>             Project: Hive
>          Issue Type: Bug
>          Components: Metastore, Transactions
>    Affects Versions: 1.0.0
>            Reporter: Eugene Koifman
>            Assignee: Eugene Koifman
>            Priority: Critical
>         Attachments: HIVE-13014.01.patch, HIVE-13014.02.patch, HIVE-13014.03.patch, HIVE-13014.04.patch,
HIVE-13014.05.patch, HIVE-13014.06.patch, HIVE-13014.07.patch
>
>
> Not all metastore operations are idempotent.  For example, commit_txn() consists of 
> 1. request from client to server
> 2. server action
> 3. ack to client
> If network connection is broken after (or during) 2 but before 3 happens, RetryingMetastoreClient
will retry the operation thus causing an attempt to commit the same txn twice (sometimes in
concurrently)
> The 2nd attempt is guaranteed to fail and thus return an error to the caller (which doesn't
know the operation is being retried), while the first attempt has actually succeeded.  Thus
the caller thinks commit failed and will likely attempt to redo the transactions - not what
we want in most cases.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message