cocoon-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vadim Gritsenko <vadim.gritse...@verizon.net>
Subject Re: Converting html documents to plain text
Date Fri, 06 Sep 2002 13:48:49 GMT
Piroumian Konstantin wrote:

> Cocoon allows to setup a pipeline that will retrieve remote HTML 
> documents, transform them and write to the HD.
> I think you'll have to run Cocoon from command line and provide the 
> list of all the documents you need.
>  
> Though, I'm not so sure that Cocoon's command line interface will work 
> with remote documents.


It will.

Vadim


>  
> -- 
> Konstantin Piroumian 
>
>     -----Original Message-----
>     *From:* Francesco Marchioni [mailto:marchioni.francesco@libero.it]
>     *Sent:* Thursday, September 05, 2002 7:00 PM
>     *To:* cocoon-users@xml.apache.org
>     *Subject:* Converting html documents to plain text
>
>     Hi all,
>     I have to task to convert lots of html documents
>     downloaded with the URLConnection class....
>     URL url = new URL("http://host/document.htm");
>     URLConnection urlConnection = url.openConnection();
>     ...into plain text document to be written on my hard disk.
>     I wonder if I can do it easily with Cocoon....I don't know it
>     deeply......could you give some feedback ??
>     Thanks
>     Francesco
>      
>
>
>
>     _______________________________________________________________________________
>     <http://www.incredimail.com/redir.asp?ad_id=316&lang=16> 
>     /IncrediMail/ - *il mondo della posta elettronica si รจ finalmente
>     evoluto* - *_Clicca Qui_*
>     <http://www.incredimail.com/redir.asp?ad_id=316&lang=16> 
>



Mime
  • Unnamed multipart/related (inline, None, 0 bytes)
View raw message