commons-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Serge Knystautas <sknystau...@gmail.com>
Subject Re: HttpClient and best way to parse an Html file
Date Thu, 19 May 2005 19:55:24 GMT
On 5/17/05, Rajat Sharma <rsharma@airvananet.com> wrote:
> Hi Folks,
> 
> I am implementing a http client using httpclient package. I need to parse the html file
to get the valid "name" fields, so I could fill them up with some "values" on the client side
and then post the form.
> 
> What is the best way to parse the html file or the only way is to have my own raw parser.

I highly recommend Jericho (http://jerichohtml.sourceforge.net/). 
Gives you DOM access, easy way to swap in new html, and is very good
at handling bad html.

-- 
Serge Knystautas
Lokitech >> software . strategy . design >> http://www.lokitech.com
p. 301.656.5501
e. sergek@lokitech.com

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-user-help@jakarta.apache.org


Mime
View raw message