Return-Path: Delivered-To: apmail-forrest-dev-archive@www.apache.org Received: (qmail 4949 invoked from network); 12 Jul 2007 23:19:59 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 12 Jul 2007 23:19:59 -0000 Received: (qmail 76788 invoked by uid 500); 12 Jul 2007 23:20:01 -0000 Delivered-To: apmail-forrest-dev-archive@forrest.apache.org Received: (qmail 76753 invoked by uid 500); 12 Jul 2007 23:20:01 -0000 Mailing-List: contact dev-help@forrest.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: dev@forrest.apache.org List-Id: Delivered-To: mailing list dev@forrest.apache.org Received: (qmail 76742 invoked by uid 99); 12 Jul 2007 23:20:01 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 12 Jul 2007 16:20:01 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of ross.gardler@googlemail.com designates 209.85.146.179 as permitted sender) Received: from [209.85.146.179] (HELO wa-out-1112.google.com) (209.85.146.179) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 12 Jul 2007 16:19:58 -0700 Received: by wa-out-1112.google.com with SMTP id v27so373799wah for ; Thu, 12 Jul 2007 16:19:36 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=googlemail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:sender:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references:x-google-sender-auth; b=GSSpRBBHgTNIdth402ryp9yJfp0Y2S2vAun02kUnT0t8VtHrfnfqqS6+enhes2Vuk2QdytUui2Xqd3ikWgLJbRzEE3x1FfuBM6Kv37qPPHO1DP5On3y9YFc96M2Nq+wug+ywFzVbS6Y+ud8KXNzMeuZ7zL9r2djDb/X6FrTeD8Y= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=beta; h=received:message-id:date:from:sender:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references:x-google-sender-auth; b=ZVKAt/mVjxSJbte/vUpKuADwVT5spAI764ejbaDKm3EKiKIhgqZFjocOpPO1ccyn5yZxXiboC2qjWoyWJ91xdl24ZtSSqICvZgtfF0rvClVSw1bxuFWZ44JXibE+eQk8bPp9f4i18J29wqgOjuDdbHV9ipn6egrfwlQ0nKOWDp4= Received: by 10.114.106.1 with SMTP id e1mr1063946wac.1184282375560; Thu, 12 Jul 2007 16:19:35 -0700 (PDT) Received: by 10.114.103.17 with HTTP; Thu, 12 Jul 2007 16:19:35 -0700 (PDT) Message-ID: <61c9bc470707121619m67061653u2eb09a7446477ae8@mail.gmail.com> Date: Fri, 13 Jul 2007 00:19:35 +0100 From: "Ross Gardler" Sender: ross.gardler@googlemail.com To: dev@forrest.apache.org, sina@khakbaz.com Subject: Re: Type Detection for RDF/XML files In-Reply-To: <36815.192.168.1.35.1184067327.webmail@192.168.1.35> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <36815.192.168.1.35.1184067327.webmail@192.168.1.35> X-Google-Sender-Auth: 98216f094764220c X-Virus-Checked: Checked by ClamAV on apache.org On 10/07/07, Sina K. Heshmati wrote: > I'm currently using the following pattern to catch the target files for the SKOS plugin: > > > ... > > > As the pattern suggests, files whose name ends with 'skos.xml' will be processed as SKOS files. Fine, but how bad is it? It means that all requests need a fixed URL space. Forrest should not define a users URL space. However, keeping things moving is important at this stage. > The issue FOR-1005 [1] suggests that the 'sourcetypeaction' mechanism should be used > to determine the type of a file and set actions accordingly. We see in FOR-1018 [2] > that we need to extend our existing implementation of the sourcetypeaction mechanism > to be able to support RDF/XML formats. > > One might take an approach similar to bootstrapping, that is: > > 0. Try to determine the file type by parsing the filename; if failed, > 1. Fallback to sourcetypeaction mechanism with a minimum number of tests; if failed, > 2. Type left undetermined, that is no actions. > > If we decide to parse the filename to determine the type of a file, then we'll need to have conventions set for each format. > > What do you think should be done to determine the type of RDF/XML formats? In an ideal world we will always avoid using filename. However, there is a need to progress with your work and a good enough solution is, well, good enough, until someone has the desire to fix it. I would say your 0. and 1. steps should be reversed. Ross