Return-Path: Delivered-To: apmail-jakarta-commons-user-archive@www.apache.org Received: (qmail 4595 invoked from network); 19 May 2005 11:00:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 19 May 2005 11:00:54 -0000 Received: (qmail 75927 invoked by uid 500); 19 May 2005 10:33:03 -0000 Delivered-To: apmail-jakarta-commons-user-archive@jakarta.apache.org Received: (qmail 75844 invoked by uid 500); 19 May 2005 10:33:02 -0000 Mailing-List: contact commons-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Help: List-Post: List-Id: "Jakarta Commons Users List" Reply-To: "Jakarta Commons Users List" Delivered-To: mailing list commons-user@jakarta.apache.org Received: (qmail 75755 invoked by uid 99); 19 May 2005 10:33:00 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=FORGED_RCVD_HELO X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from mta.songnetworks.no (HELO pebbles.fastcom.no) (62.73.241.54) by apache.org (qpsmtpd/0.28) with ESMTP; Thu, 19 May 2005 03:32:50 -0700 Received: from [172.20.6.114] (unknown [217.68.107.130]) by pebbles.fastcom.no (Postfix) with ESMTP id F2EAB1A4E for ; Thu, 19 May 2005 12:32:10 +0200 (DFT) Message-ID: <428C6B2F.2040903@start.no> Date: Thu, 19 May 2005 12:32:15 +0200 From: Lee Francis Wilhelmsen User-Agent: Mozilla Thunderbird 1.0.2 (Windows/20050317) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Jakarta Commons Users List Subject: Re: HttpClient and best way to parse an Html file References: <1116494095.93856.ezmlm@jakarta.apache.org> In-Reply-To: <1116494095.93856.ezmlm@jakarta.apache.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hi You might want to check out http://htmlparser.sourceforge.net/ It's pretty good, but documentation is lacking. However, there are some examples to get you started. Regards Lee Francis Wilhelmsen > Subject: HttpClient and best way to parse an Html file > From: "Rajat Sharma" > Date: Tue, 17 May 2005 09:51:51 -0400 > > Hi Folks, > > I am implementing a http client using httpclient package. I need to parse the html file to get the valid "name" fields, so I could fill them up with some "values" on the client side and then post the form. > > What is the best way to parse the html file or the only way is to have my own raw parser. > > Any suggestions how I can go about it. --------------------------------------------------------------------- To unsubscribe, e-mail: commons-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: commons-user-help@jakarta.apache.org