commons-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Serge Knystautas <>
Subject Re: HttpClient and best way to parse an Html file
Date Thu, 19 May 2005 19:55:24 GMT
On 5/17/05, Rajat Sharma <> wrote:
> Hi Folks,
> I am implementing a http client using httpclient package. I need to parse the html file
to get the valid "name" fields, so I could fill them up with some "values" on the client side
and then post the form.
> What is the best way to parse the html file or the only way is to have my own raw parser.

I highly recommend Jericho ( 
Gives you DOM access, easy way to swap in new html, and is very good
at handling bad html.

Serge Knystautas
Lokitech >> software . strategy . design >>
p. 301.656.5501

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message