hc-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebastiano Vigna (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HTTPCLIENT-1498) "java.lang.IllegalArgumentException: Host name may not be blank" thrown during redirect (regression?)
Date Mon, 15 Sep 2014 13:36:33 GMT

    [ https://issues.apache.org/jira/browse/HTTPCLIENT-1498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14133889#comment-14133889
] 

Sebastiano Vigna commented on HTTPCLIENT-1498:
----------------------------------------------

I just ran a crawl with 4.4 Alpha 1 but 

2014-09-12 20:27:06,679 28916608 ERROR [FetchingThread-7] i.u.d.l.b.f.FetchingThread - Unexpected
exception during fetch of http://9to5liberation.com/robots.txt
java.lang.IllegalArgumentException: Host name may not contain blanks
        at org.apache.http.util.Args.containsNoBlanks(Args.java:84) ~[httpcore.jar:4.4-alpha1]
        at org.apache.http.HttpHost.<init>(HttpHost.java:80) ~[httpcore.jar:4.4-alpha1]
        at org.apache.http.client.utils.URIUtils.extractHost(URIUtils.java:370) ~[httpclient.jar:4.4-alpha1]
        at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:133) ~[httpclient.jar:4.4-alpha1]
        at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:184)
~[httpclient.jar:4.4-alpha1]
        at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:73)
~[httpclient.jar:4.4-alpha1]
        at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:222)
~[httpclient.jar:4.4-alpha1]
        at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:192)
~[httpclient.jar:4.4-alpha1]
        at it.unimi.di.law.bubing.util.FetchData.fetch(FetchData.java:322) ~[bubing-0.9.5.jar:na]
        at it.unimi.di.law.bubing.frontier.FetchingThread.run(FetchingThread.java:267) ~[bubing-0.9.5.jar:na]


> "java.lang.IllegalArgumentException: Host name may not be blank" thrown during redirect
(regression?)
> -----------------------------------------------------------------------------------------------------
>
>                 Key: HTTPCLIENT-1498
>                 URL: https://issues.apache.org/jira/browse/HTTPCLIENT-1498
>             Project: HttpComponents HttpClient
>          Issue Type: Bug
>          Components: HttpClient
>    Affects Versions: 4.3.3
>            Reporter: Sebastiano Vigna
>             Fix For: 4.3.4, 4.4 Alpha1
>
>
> The bug we reported some time ago about null hosts in redirects seems to have regressed,
albeit the old problem was with "null" and the new problem is with "blank":
> 2014-04-20 04:20:09,169 19319369 ERROR [FetchingThread-197] i.u.d.l.b.f.FetchingThread
- Unexpected exception
> java.lang.IllegalArgumentException: Host name may not be blank
>         at org.apache.http.util.Args.notBlank(Args.java:68) ~[httpcore.jar:4.3.2]
>         at org.apache.http.HttpHost.<init>(HttpHost.java:81) ~[httpcore.jar:4.3.2]
>         at org.apache.http.client.utils.URIUtils.extractHost(URIUtils.java:370) ~[httpclient.jar:4.3.3]
>         at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:132)
~[httpclient.jar:4.3.3]
>         at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:186)
~[httpclient.jar:4.3.3]
>         at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:72)
~[httpclient.jar:4.3.3]
>         at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:214)
~[httpclient.jar:4.3.3]
>         at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:185)
~[httpclient.jar:4.3.3]
>         at it.unimi.di.law.bubing.util.FetchData.fetch(FetchData.java:322) ~[bubing-0.9.3.jar:na]
> This is caused by this site:
> > wget --max-redirect=0 http://www.thegamersedge.co.uk/robots.txt
> --2014-04-20 20:47:43--  http://www.thegamersedge.co.uk/robots.txt
> Resolving www.thegamersedge.co.uk (www.thegamersedge.co.uk)... 72.1.201.156, 72.1.201.152
> Connecting to www.thegamersedge.co.uk (www.thegamersedge.co.uk)|72.1.201.156|:80... connected.
> HTTP request sent, awaiting response... 302 Moved Temporarily
> Location: http://robots.txt [following]
> 0 redirections exceeded.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@hc.apache.org
For additional commands, e-mail: dev-help@hc.apache.org


Mime
View raw message