Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7A99718887 for ; Mon, 5 Oct 2015 18:16:37 +0000 (UTC) Received: (qmail 7141 invoked by uid 500); 5 Oct 2015 18:16:34 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 7073 invoked by uid 500); 5 Oct 2015 18:16:34 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 7052 invoked by uid 99); 5 Oct 2015 18:16:33 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Oct 2015 18:16:33 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id DF727C0952 for ; Mon, 5 Oct 2015 18:16:32 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.997 X-Spam-Level: ** X-Spam-Status: No, score=2.997 tagged_above=-999 required=6.31 tests=[KAM_BADIPHTTP=2, KAM_LAZY_DOMAIN_SECURITY=1, MIME_QP_LONG_LINE=0.001, RP_MATCHES_RCVD=-0.006, URIBL_BLOCKED=0.001, WEIRD_PORT=0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id IQ73aGsEQFfD for ; Mon, 5 Oct 2015 18:16:18 +0000 (UTC) Received: from mail.openindex.io (mail.openindex.io [178.21.113.82]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTP id 57A09204DD for ; Mon, 5 Oct 2015 18:16:18 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail.openindex.io (Postfix) with ESMTP id 09275382010 for ; Mon, 5 Oct 2015 18:16:17 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at mail.openindex.io Received: from mail.openindex.io ([127.0.0.1]) by localhost (mail.openindex.io [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id N6QKWHYtnPeY for ; Mon, 5 Oct 2015 18:16:16 +0000 (UTC) Received: from mail.openindex.io (localhost [127.0.0.1]) by mail.openindex.io (Postfix) with ESMTP id 25E0238200C for ; Mon, 5 Oct 2015 18:16:16 +0000 (UTC) Subject: RE: Implementing AbstractFullDistribZkTestBase From: =?utf-8?Q?Markus_Jelsma?= To: =?utf-8?Q?solr-user=40lucene=2Eapache=2Eorg?= Date: Mon, 5 Oct 2015 18:16:16 +0000 Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Priority: 3 (Normal) X-Mailer: Zarafa 7.1.10-44973 Message-Id: Hi - no, i don't think so, it doesn't happen all the time, but too frequently. The machine running the tests has a high powered CPU, plenty of cores and RAM. Markus =20 =20 -----Original message----- > From:Mark Miller > Sent: Monday 5th October 2015 19:52 > To: solr-user@lucene.apache.org > Subject: Re: Implementing AbstractFullDistribZkTestBase >=20 > If it's always when using https as in your examples, perhaps it's SOLR-5776. >=20 > - mark >=20 > On Mon, Oct 5, 2015 at 10:36 AM Markus Jelsma > wrote: >=20 > > Hmmm, i tried that just now but i sometimes get tons of Connection reset > > errors. The tests then end with "There are still nodes recoverying - waited > > for 30 seconds". > > > > [RecoveryThread-collection1] ERROR org.apache.solr.cloud.RecoveryStrategy > > - Error while trying to recover.:java.util.concurrent.ExecutionException: > > org.apache.solr.client.solrj.SolrServerException: IOException occured when > > talking to server at: https://127.0.0.1:49146 > > at java.util.concurrent.FutureTask.report(FutureTask.java:122) > > at java.util.concurrent.FutureTask.get(FutureTask.java:192) > > at > > org.apache.solr.cloud.RecoveryStrategy.sendPrepRecoveryCmd(RecoveryStrategy.java:598) > > at > > org.apache.solr.cloud.RecoveryStrategy.doRecovery(RecoveryStrategy.java:361) > > at > > org.apache.solr.cloud.RecoveryStrategy.run(RecoveryStrategy.java:227) > > Caused by: org.apache.solr.client.solrj.SolrServerException: IOException > > occured when talking to server at: https://127.0.0.1:49146 > > at > > org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:574) > > at > > org.apache.solr.client.solrj.impl.HttpSolrClient$1.call(HttpSolrClient.java:270) > > at > > org.apache.solr.client.solrj.impl.HttpSolrClient$1.call(HttpSolrClient.java:266) > > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > > at > > org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor$1.run(ExecutorUtil.java:210) > > at > > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > > at > > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > > at java.lang.Thread.run(Thread.java:745) > > Caused by: java.net.SocketException: Connection reset > > at java.net.SocketInputStream.read(SocketInputStream.java:209) > > at java.net.SocketInputStream.read(SocketInputStream.java:141) > > at sun.security.ssl.InputRecord.readFully(InputRecord.java:465) > > at sun.security.ssl.InputRecord.read(InputRecord.java:503) > > at > > sun.security.ssl.SSLSocketImpl.readRecord(SSLSocketImpl.java:954) > > at > > sun.security.ssl.SSLSocketImpl.performInitialHandshake(SSLSocketImpl.java:1343) > > at > > sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1371) > > at > > sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1355) > > at > > org.apache.http.conn.ssl.SSLSocketFactory.connectSocket(SSLSocketFactory.java:543) > > at > > org.apache.http.conn.ssl.SSLSocketFactory.connectSocket(SSLSocketFactory.java:409) > > at > > org.apache.http.impl.conn.DefaultClientConnectionOperator.openConnection(DefaultClientConnectionOperator.java:177) > > at > > org.apache.http.impl.conn.ManagedClientConnectionImpl.open(ManagedClientConnectionImpl.java:304) > > at > > org.apache.http.impl.client.DefaultRequestDirector.tryConnect(DefaultRequestDirector.java:611) > > at > > org.apache.http.impl.client.DefaultRequestDirector.execute(DefaultRequestDirector.java:446) > > at > > org.apache.http.impl.client.AbstractHttpClient.doExecute(AbstractHttpClient.java:882) > > at > > org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:82) > > at > > org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:107) > > at > > org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:55) > > at > > org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:465) > > ... 7 more > > > > [RecoveryThread-collection1] ERROR org.apache.solr.cloud.RecoveryStrategy > > - Recovery failed - trying again... (1) > > [RecoveryThread-collection1] INFO org.apache.solr.cloud.RecoveryStrategy - > > Wait 4.0 seconds before trying to recover again (2) > > > > > > > > -----Original message----- > > > From:Erick Erickson > > > Sent: Monday 5th October 2015 15:59 > > > To: solr-user@lucene.apache.org > > > Subject: Re: Implementing AbstractFullDistribZkTestBase > > > > > > Right, I'm assuming you're creating a cluster somewhere. > > > Try calling (from memory) waitForRecoveriesToFinish in > > > AbstractDistribZkTestBase after creating the collection > > > to insure that the nodes are up and running before you > > > index to them. > > > > > > Shot in the dark > > > Erick > > > > > > On Mon, Oct 5, 2015 at 1:36 AM, Markus Jelsma > > > wrote: > > > > Hello, > > > > > > > > I have several implementations of AbstractFullDistribZkTestBase of > > Solr 5.3.0. Sometimes a test fails with either "There are still nodes > > recoverying - waited for 30 seconds" or "IOException occured when talking > > to server at: https://127.0.0.1:44474/collection1", so usually at least > > one of all test fails. These are very simple implementations such as : > > > > > > > > @Test > > > > @ShardsFixed(num =3D 2) > > > > public void testSomeStuff() throws Exception { > > > > waitForThingsToLevelOut(30); > > > > > > > > del("*:*"); > > > > indexDoc("input 1", "user_1"); > > > > indexDoc("input 1", "user_2"); > > > > indexDoc("input 2", "user_1"); > > > > indexDoc("input 2", "user_2"); > > > > commit(); > > > > > > > > Any idea on how to get rid of the nodes recovering issue=3F > > > > Markus > > > > > > --=20 > - Mark > about.me/markrmiller >=20