Return-Path: Delivered-To: apmail-cocoon-dev-archive@www.apache.org Received: (qmail 82320 invoked from network); 17 Oct 2008 16:07:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 17 Oct 2008 16:07:14 -0000 Received: (qmail 93692 invoked by uid 500); 17 Oct 2008 16:07:16 -0000 Delivered-To: apmail-cocoon-dev-archive@cocoon.apache.org Received: (qmail 93310 invoked by uid 500); 17 Oct 2008 16:07:15 -0000 Mailing-List: contact dev-help@cocoon.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: dev@cocoon.apache.org List-Id: Delivered-To: mailing list dev@cocoon.apache.org Received: (qmail 93299 invoked by uid 99); 17 Oct 2008 16:07:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 17 Oct 2008 09:07:14 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 17 Oct 2008 16:06:06 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 57A2F234C234 for ; Fri, 17 Oct 2008 09:06:44 -0700 (PDT) Message-ID: <1455293735.1224259604357.JavaMail.jira@brutus> Date: Fri, 17 Oct 2008 09:06:44 -0700 (PDT) From: "Reinhard Poetz (JIRA)" To: dev@cocoon.apache.org Subject: [jira] Closed: (COCOON3-5) Add an HTML2XHTML converter as Starter In-Reply-To: <1349778677.1223992364331.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/COCOON3-5?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reinhard Poetz closed COCOON3-5. -------------------------------- Resolution: Fixed Thanks Simone! We applied your patch with some minor modification so that it runs with sitemaps too. > Add an HTML2XHTML converter as Starter > -------------------------------------- > > Key: COCOON3-5 > URL: https://issues.apache.org/jira/browse/COCOON3-5 > Project: Cocoon 3 > Issue Type: Improvement > Components: cocoon-optional > Affects Versions: 3.0.0-alpha-2 > Reporter: Simone Tripodi > Assignee: Cocoon Developers Team > Priority: Minor > Fix For: 3.0.0-alpha-2 > > Attachments: NekoGenerator.patch > > > This starter component for the pipeline is a component that transform an HTML content, taken by the specified URL, and transform it in XHTML or, at least, a well-formed XML document. > So now the original document can be processed in the pipeline in various ways: > * following links; > * implementing crwalers; > * easy transforming the original document in other various formats; > * etc... > I want to explain the need of this component with a testcase; last week I had to face a singular problem, realizing a simple service that takes in input an HTML page's URL, and transform it , through the Optimus' XSLT (http://microformatique.com/optimus - http://code.google.com/p/mf-optimus/source/browse/#svn/trunk/xsl) in an XML document that contains the original doc's Microformats, in an easier and more parsable formats. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.