httpd-modules-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pitchaimani Muthuveeran" <pm...@msys-tech.com>
Subject Re: Extract Url from response page
Date Sat, 18 Aug 2007 06:00:55 GMT
I am trying to resolve a similar problem, where I want to extract just the 
<a href=""> alone from the response content. Not intersted in parsing and 
getting other tag URI information.

-Mani

----- Original Message ----- 
From: "Tim Bray" <Tim.Bray@Sun.COM>
To: <modules-dev@httpd.apache.org>
Sent: Friday, August 17, 2007 10:17 PM
Subject: Re: Extract Url from response page


> In general it's a hard problem.   How much do you know of the  possible 
> format of the response page?  Does it use AJAX magic to pull  in pieces 
> XmlHTTPRequest?  Do you only care about URIs in <a href="">  or are you 
> interested in <link> or even URIs embedded in running  text?  -Tim
>
> On Aug 17, 2007, at 7:00 AM, prasanna wrote:
>
>> I am looking for the things from APR or core apache libraries. If  we 
>> don't have one from APR then we will go for libxml
>>
>>
>> Ray Morris wrote:
>>>    See libxml.
>>> -- 
>>> Ray B. Morris
>>> support@bettercgi.com
>>>
>>> Strongbox - The next generation in site security:
>>> http://www.bettercgi.com/strongbox/
>>>
>>>
>>>
>>> On 08/17/2007 04:03:24 AM, prasana wrote:
>>>> Hi,
>>>>
>>>> Is there any API functions to extract url from an response page. I
>>>> need to log all the url in the response page.
>>>>
>>>> Thanks in Advance.
>>>>
>>>>
>>>
>>>
>>
>
> 


Mime
View raw message