From issues-return-728-archive-asf-public=cust-asf.ponee.io@zookeeper.apache.org Wed Aug 21 16:59:02 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 3DB38180607 for ; Wed, 21 Aug 2019 18:59:02 +0200 (CEST) Received: (qmail 49284 invoked by uid 500); 21 Aug 2019 16:59:01 -0000 Mailing-List: contact issues-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@zookeeper.apache.org Delivered-To: mailing list issues@zookeeper.apache.org Received: (qmail 49275 invoked by uid 99); 21 Aug 2019 16:59:01 -0000 Received: from mailrelay1-us-west.apache.org (HELO mailrelay1-us-west.apache.org) (209.188.14.139) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 Aug 2019 16:59:01 +0000 Received: from jira-he-de.apache.org (static.172.67.40.188.clients.your-server.de [188.40.67.172]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id C1380E300B for ; Wed, 21 Aug 2019 16:59:00 +0000 (UTC) Received: from jira-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira-he-de.apache.org (ASF Mail Server at jira-he-de.apache.org) with ESMTP id 3C062780185 for ; Wed, 21 Aug 2019 16:59:00 +0000 (UTC) Date: Wed, 21 Aug 2019 16:59:00 +0000 (UTC) From: "Michael Han (Jira)" To: issues@zookeeper.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (ZOOKEEPER-3513) Zookeeper upgrade fails due to missing snapshots MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ZOOKEEPER-3513?page=3Dcom.atlas= sian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D= 16912520#comment-16912520 ]=20 Michael Han commented on ZOOKEEPER-3513: ---------------------------------------- there is a workaround described in ZOOKEEPER-3056 by using an empty snapsho= t file, which is not ideal but should work to unblock any upgrades failed d= ue to missing snapshot. > Zookeeper upgrade fails due to missing snapshots > ------------------------------------------------ > > Key: ZOOKEEPER-3513 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3513 > Project: ZooKeeper > Issue Type: Bug > Components: server > Affects Versions: 3.5.4, 3.6.0 > Reporter: Stephan Huttenhuis > Priority: Major > > In ZOOKEEPER-2325 a check was added that requires a snapshot=C2=A0when lo= ading data. We have been running 3-node ensembles on Zookeeper 3.4.13 for a= bout 5 months for use with Solr Cloud. During this time some ensembles crea= ted a few snapshots but other didn't generate any. Because of this upgradin= g to e.g. 3.5.5 fails. > Either it is perfectly possible for Zookeeper data to have no snapshots o= r something is going wrong with generating snapshots. The ensembles are str= aightforward. > - The following stack occurs: > {noformat} > java.io.IOException: No snapshot found, but there are log entries. Someth= ing is broken! > =09at org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(File= TxnSnapLog.java:211) > =09at org.apache.zookeeper.server.ZKDatabase.loadDataBase(ZKDatabase.java= :240) > =09at org.apache.zookeeper.server.ZooKeeperServer.loadData(ZooKeeperServe= r.java:290) > =09at org.apache.zookeeper.server.ZooKeeperServer.startdata(ZooKeeperServ= er.java:450) > =09at org.apache.zookeeper.server.NIOServerCnxnFactory.startup(NIOServerC= nxnFactory.java:764) > =09at org.apache.zookeeper.server.ServerCnxnFactory.startup(ServerCnxnFac= tory.java:98) > =09at org.apache.zookeeper.server.ZooKeeperServerMain.runFromConfig(ZooKe= eperServerMain.java:144) > =09at org.apache.zookeeper.server.ZooKeeperServerMain.initializeAndRun(Zo= oKeeperServerMain.java:106) > =09at org.apache.zookeeper.server.ZooKeeperServerMain.main(ZooKeeperServe= rMain.java:64) > =09at org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(= QuorumPeerMain.java:128) > =09at org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMa= in.java:82) > {noformat} > - The zoo.cfg > {noformat} > # The number of milliseconds of each tick > tickTime=3D2000 > # The number of ticks that the initial > # synchronization phase can take > initLimit=3D10 > # The number of ticks that can pass between > # sending a request and getting an acknowledgement > syncLimit=3D5 > # the directory where the snapshot is stored. > # do not use /tmp for storage, /tmp here is just > # example sakes. > dataDir=3D/data/zookeeper/data > # the port at which the clients will connect > clientPort=3D2181 > server.1=3Dmyserver1:2888:3888 > server.2=3Dmyserver2:2888:3888 > server.3=3Dmyserver3:2888:3888 > {noformat} > =C2=A0 > - The contents of /data/zookeeper/data/version-2 > {noformat} > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 =C2=A0 1 Aug=C2=A0 7 21:50 accepte= dEpoch > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 =C2=A0 1 Aug=C2=A0 8 20:38 current= Epoch > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 65M Apr=C2=A0 1 14:44 log.1 > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 65M May 15 23:30 log.100000001 > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 65M Jul=C2=A0 3 23:21 log.10000164= 5 > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 65M Aug=C2=A0 8 20:37 log.30000080= 2 > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 65M Aug 20 13:58 log.70000062a > -rw-r--r-- 1 zookeeper zookeeper=C2=A0 65M Apr=C2=A0 4 21:22 log.f0 > {noformat} -- This message was sent by Atlassian Jira (v8.3.2#803003)