hc-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebastiano Vigna (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HTTPCLIENT-1829) Apparently normal SSL site generates a handshake failure
Date Fri, 03 Mar 2017 10:29:45 GMT
Sebastiano Vigna created HTTPCLIENT-1829:
--------------------------------------------

             Summary: Apparently normal SSL site generates a handshake failure
                 Key: HTTPCLIENT-1829
                 URL: https://issues.apache.org/jira/browse/HTTPCLIENT-1829
             Project: HttpComponents HttpClient
          Issue Type: Bug
          Components: HttpClient (classic)
    Affects Versions: 4.5.3
         Environment: Linux localhost.localdomain 4.9.10-200.fc25.x86_64 #1 SMP Wed Feb 15
23:28:59 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

java version "1.8.0_121"
Java(TM) SE Runtime Environment (build 1.8.0_121-b13)
Java HotSpot(TM) 64-Bit Server VM (build 25.121-b13, mixed mode)

            Reporter: Sebastiano Vigna
            Priority: Minor


When our crawler (BUbiNG) tries to fetch this robots.txt file

http://isizulu.net/robots.txt

HTTP client finds a redirect to

https://isizulu.net/robots.txt

But than it dies with the exception below. There is no problem with Chrome etc. or wget. We
configure the client as

		robotsRequestConfig = RequestConfig.custom()
				.setRedirectsEnabled( true )
				.setMaxRedirects( 5 ) 
				.build();


For your amusement, this is the "bug report" we got (the sender is omitted for mercy):

-----------
Subject: Bubing borken by design?

Hi,

lately I've been seeing your Bubing crawler trying to retrieve
http://isizulu.net/robots.txt but it doesn't seem to be capable of
handling the redirect to https://isizulu.net/robots.txt so I am
wondering what you have actually been doing during "the last ten years
of research" as it says on your site (a minimum finding could have been
that Java is just crap).

-----------------------
javax.net.ssl.SSLHandshakeException: Received fatal alert: handshake_failure
        at sun.security.ssl.Alerts.getSSLException(Alerts.java:192)
        at sun.security.ssl.Alerts.getSSLException(Alerts.java:154)
        at sun.security.ssl.SSLSocketImpl.recvAlert(SSLSocketImpl.java:2023)
        at sun.security.ssl.SSLSocketImpl.readRecord(SSLSocketImpl.java:1125)
        at sun.security.ssl.SSLSocketImpl.performInitialHandshake(SSLSocketImpl.java:1375)
        at sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1403)
        at sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1387)
        at org.apache.http.conn.ssl.SSLConnectionSocketFactory.createLayeredSocket(SSLConnectionSocketFactory.java:396)
        at org.apache.http.conn.ssl.SSLConnectionSocketFactory.connectSocket(SSLConnectionSocketFactory.java:355)
        at org.apache.http.impl.conn.DefaultHttpClientConnectionOperator.connect(DefaultHttpClientConnectionOperator.java:142)
        at org.apache.http.impl.conn.BasicHttpClientConnectionManager.connect(BasicHttpClientConnectionManager.java:323)
        at org.apache.http.impl.execchain.MainClientExec.establishRoute(MainClientExec.java:381)
        at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:237)
        at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:185)
        at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:89)
        at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:111)
        at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:185)
        at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:72)
        at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:221)
        at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:191)
        at it.unimi.di.law.bubing.util.FetchData.fetch(FetchData.java:322)
        at it.unimi.di.law.bubing.frontier.FetchingThread.run(FetchingThread.java:239)




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@hc.apache.org
For additional commands, e-mail: dev-help@hc.apache.org


Mime
View raw message