Return-Path: Delivered-To: apmail-hc-dev-archive@www.apache.org Received: (qmail 954 invoked from network); 17 May 2009 17:32:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 17 May 2009 17:32:15 -0000 Received: (qmail 32685 invoked by uid 500); 17 May 2009 17:32:14 -0000 Delivered-To: apmail-hc-dev-archive@hc.apache.org Received: (qmail 32615 invoked by uid 500); 17 May 2009 17:32:14 -0000 Mailing-List: contact dev-help@hc.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "HttpComponents Project" Delivered-To: mailing list dev@hc.apache.org Received: (qmail 32605 invoked by uid 99); 17 May 2009 17:32:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 17 May 2009 17:32:14 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 17 May 2009 17:32:05 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 8D06B29A0011 for ; Sun, 17 May 2009 10:31:45 -0700 (PDT) Message-ID: <409228709.1242581505563.JavaMail.jira@brutus> Date: Sun, 17 May 2009 10:31:45 -0700 (PDT) From: "Oleg Kalnichevski (JIRA)" To: dev@hc.apache.org Subject: [jira] Updated: (HTTPCORE-195) Make it easier to override the default behavior of ChunkDecoder through subclassing In-Reply-To: <1898615537.1241797186048.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HTTPCORE-195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Kalnichevski updated HTTPCORE-195: --------------------------------------- Priority: Minor (was: Critical) Fix Version/s: 4.1 Issue Type: Improvement (was: Bug) Summary: Make it easier to override the default behavior of ChunkDecoder through subclassing (was: ChunkDecoder is overly sensitive to truncated chucks) > Make it easier to override the default behavior of ChunkDecoder through subclassing > ----------------------------------------------------------------------------------- > > Key: HTTPCORE-195 > URL: https://issues.apache.org/jira/browse/HTTPCORE-195 > Project: HttpComponents HttpCore > Issue Type: Improvement > Components: HttpCore NIO > Affects Versions: 4.0 > Reporter: Patrick Moore > Priority: Minor > Fix For: 4.1 > > > Our server is webcrawling. > We are frequently encountering this issue. We think this might be related to something on the server that we are scanning. But that doesn't matter. We need to handle such cases without exceptions. (From my perspective, such things should generate a debug message -- certainly not an exception that ends processing and throws away the retrieved content! ) > http://stuftpizza.com/ seems to reliably result in this problem > May be TransferEncoding? http://www.w3.org/Protocols/rfc2616/rfc2616-sec3.html#sec3.6 > Either way we need to be able to deal with issues on the other servers. > {{{ > Date Mon, 20 Apr 2009 03:56:45 GMT > Server Apache/2.2.3 (Red Hat) > Accept-Ranges bytes > Connection close > Transfer-Encoding chunked > Content-Type text/html > '''Request Headers''' > Host stuftpizza.com > User-Agent Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.0.8) Gecko/2009032608 Firefox/3.0.8 > Accept text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8 > Accept-Language en-us,en;q=0.5 > Accept-Encoding gzip,deflate > Accept-Charset ISO-8859-1,utf-8;q=0.7,*;q=0.7 > Keep-Alive 300 > Connection keep-alive > Cookie > __utma=47358053.1237981682.1240199754.1240199754.1240199754.1; __utmb=47358053; __utmc=47358053; __utmz > =47358053.1240199754.1.1.utmccn=(direct)|utmcsr=(direct)|utmcmd=(none) > Cache-Control max-age=0 > }}} > {{{ > 20:51:08,768 INFO [nioEventListener] Request http://stuftpizza.com/ failed with exception. > org.apache.http.MalformedChunkCodingException: Truncated chunk > at org.apache.http.impl.nio.codecs.ChunkDecoder.read(ChunkDecoder.java:203) > at org.apache.http.nio.util.SimpleInputBuffer.consumeContent(SimpleInputBuffer.java:60) > at org.apache.http.nio.entity.BufferingNHttpEntity.consumeContent(BufferingNHttpEntity.java:72) > at org.apache.http.nio.protocol.AsyncNHttpClientHandler.inputReady(AsyncNHttpClientHandler.java:236) > at org.apache.http.nio.protocol.BufferingHttpClientHandler.inputReady(BufferingHttpClientHandler.java:118) > at org.apache.http.impl.nio.DefaultNHttpClientConnection.consumeInput(DefaultNHttpClientConnection.java:178) > at org.apache.http.impl.nio.DefaultClientIOEventDispatch.inputReady(DefaultClientIOEventDispatch.java:146) > at com.amplafi.iomanagement.http.UniversalIOEventDispatch.inputReady(UniversalIOEventDispatch.java:133) > at $IOEventDispatch_120c19cd1c7.inputReady($IOEventDispatch_120c19cd1c7.java) > at org.apache.http.impl.nio.reactor.BaseIOReactor.readable(BaseIOReactor.java:153) > at org.apache.http.impl.nio.reactor.AbstractIOReactor.processEvent(AbstractIOReactor.java:314) > at org.apache.http.impl.nio.reactor.AbstractIOReactor.processEvents(AbstractIOReactor.java:294) > at org.apache.http.impl.nio.reactor.AbstractIOReactor.execute(AbstractIOReactor.java:256) > at org.apache.http.impl.nio.reactor.BaseIOReactor.execute(BaseIOReactor.java:96) > at org.apache.http.impl.nio.reactor.AbstractMultiworkerIOReactor$Worker.run(AbstractMultiworkerIOReactor.java:556) > at java.lang.Thread.run(Thread.java:637) > }}} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@hc.apache.org For additional commands, e-mail: dev-help@hc.apache.org