Return-Path: Delivered-To: apmail-cocoon-dev-archive@www.apache.org Received: (qmail 58922 invoked from network); 9 Sep 2004 06:38:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 9 Sep 2004 06:38:29 -0000 Received: (qmail 29772 invoked by uid 500); 9 Sep 2004 06:38:16 -0000 Delivered-To: apmail-cocoon-dev-archive@cocoon.apache.org Received: (qmail 29677 invoked by uid 500); 9 Sep 2004 06:38:11 -0000 Mailing-List: contact dev-help@cocoon.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: list-post: Reply-To: dev@cocoon.apache.org Delivered-To: mailing list dev@cocoon.apache.org Received: (qmail 29656 invoked by uid 99); 9 Sep 2004 06:38:10 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=RCVD_BY_IP,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: domain of antonio.fiol@gmail.com designates 64.233.170.192 as permitted sender) Received: from [64.233.170.192] (HELO mproxy.gmail.com) (64.233.170.192) by apache.org (qpsmtpd/0.28) with ESMTP; Wed, 08 Sep 2004 23:38:09 -0700 Received: by mproxy.gmail.com with SMTP id 80so251520rnl for ; Wed, 08 Sep 2004 23:38:07 -0700 (PDT) Received: by 10.38.11.80 with SMTP id 80mr4080rnk; Wed, 08 Sep 2004 23:38:07 -0700 (PDT) Received: by 10.38.8.28 with HTTP; Wed, 8 Sep 2004 23:38:07 -0700 (PDT) Message-ID: Date: Thu, 9 Sep 2004 08:38:07 +0200 From: =?ISO-8859-1?Q?Antonio_Fiol_Bonn=EDn?= Reply-To: =?ISO-8859-1?Q?Antonio_Fiol_Bonn=EDn?= To: dev@cocoon.apache.org Subject: Custom extensions - to be made available if possible Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hello, We have started developing two extensions for cocoon, and we would like to know if the core team would be interested in getting them into the trunk, and optionally in maintaining them in the future. The extensions are: - A transformer that connects via HTTP POST and sends its XML input to the server, and returns the XML returned from the server to the pipeline. This is similar to the SOAP thing, but without the envelope, and with a predefined (configured in the sitemap) URL. - An extension to the Cocoon Lucene searching system (or something different, yet pending design), so that non-XML content can also be indexed. In particular, we are interested on PDF, but we are designing it as generic as possible. BTW, your opinion may be very valueble for the design. Let me explain the two approaches we have thought of: a) Refactoring SimpleLuceneXMLIndexerImpl so that its private method indexDocument is not private, and taking it to an external component. b) Creating a PDFGenerator (in the cocoon sense of generator, of course). Option (a) seems to be giving us more headaches than pleasure, and option (b) seems cleaner to a certain point. Option (b) would allow to follow links in the PDF file, if developed to that point. However, option (b) implies choosing a format for its output (which?), and also poses some problems wrt. the sitemap. Until now, we have a pipeline using a reader to read pdf files (static, from disk). And we would need a generator to be invoked instead for the content and links views. How can we do that? Maybe with a selector? But that does not seem very clean. Any hints there? Any other options? Any general comments? What about making these into the trunk once they are tested? Yours sincerely, Antonio Fiol