Return-Path: Delivered-To: apmail-cocoon-users-archive@www.apache.org Received: (qmail 38281 invoked from network); 8 Sep 2009 07:01:44 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Sep 2009 07:01:44 -0000 Received: (qmail 18428 invoked by uid 500); 8 Sep 2009 07:01:43 -0000 Delivered-To: apmail-cocoon-users-archive@cocoon.apache.org Received: (qmail 18350 invoked by uid 500); 8 Sep 2009 07:01:43 -0000 Mailing-List: contact users-help@cocoon.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: users@cocoon.apache.org List-Id: Delivered-To: mailing list users@cocoon.apache.org Received: (qmail 18342 invoked by uid 99); 8 Sep 2009 07:01:43 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Sep 2009 07:01:43 +0000 X-ASF-Spam-Status: No, hits=-2.8 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [64.18.2.165] (HELO exprod7og106.obsmtp.com) (64.18.2.165) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 08 Sep 2009 07:01:32 +0000 Received: from source ([209.85.218.210]) by exprod7ob106.postini.com ([64.18.6.12]) with SMTP ID DSNKSqYBNoD3r6bgiGtzHcXBAF9GgHAc3KMT@postini.com; Tue, 08 Sep 2009 00:01:12 PDT Received: by mail-bw0-f210.google.com with SMTP id 6so2457767bwz.36 for ; Tue, 08 Sep 2009 00:01:10 -0700 (PDT) Received: by 10.103.126.33 with SMTP id d33mr6474251mun.109.1252393270345; Tue, 08 Sep 2009 00:01:10 -0700 (PDT) Received: from ?192.168.1.21? ([212.241.50.201]) by mx.google.com with ESMTPS id i5sm2021543mue.46.2009.09.08.00.01.08 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 08 Sep 2009 00:01:08 -0700 (PDT) Message-ID: <4AA60133.7030708@onehippo.com> Date: Tue, 08 Sep 2009 09:01:07 +0200 From: Jeroen Reijn Organization: Hippo User-Agent: Thunderbird 2.0.0.23 (X11/20090817) MIME-Version: 1.0 To: users@cocoon.apache.org Subject: Re: how-to query an xml repository efficiently References: <7C655C04B6F59643A1EF66056C0E095E02A3B6C9@eusex01.sweden.ecsoft> In-Reply-To: <7C655C04B6F59643A1EF66056C0E095E02A3B6C9@eusex01.sweden.ecsoft> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org Hi Robby, do you perhaps have any more specs on what kind of XML database it is? At our company we have experience with an Apache Slide backed database, which we used for storing XML files and let Slide indexed them with Lucene. Then based on DASL queries we could search the repository really quickly. Next to DASK I know there are also XML databases that can use XQueries to perform fast searches on their XML database. Regards, Jeroen Robby Pelssers wrote: > Hi all, > > > > I have following use case. The customer has an xml repository which is > nothing more then a directory on filesystem which contains > subdirectories containing one or more xml files. They now want to query > those xml files on some predefined criteria which might change over time� > > > > I�m looking for a solution which results in high performance search and > some things that came to my mind was > > � extracting information and storing them in a database (e.g. > HSQLDB) > > � using lucene > > > > Is there somewhere detailed documentation available on using these? And > what would you recommend for my use case? > > > > I already found some stuff but no real quick-start material. > > http://cocoon.apache.org/2.1/userdocs/concepts/xmlsearching.html > > http://cocoon.apache.org/2.2/blocks/hsqldb-client/1.0/ > > http://cocoon.apache.org/2.2/blocks/hsqldb-server/1.0/ > > > > Thx in advance, > > Robby Pelssers > > > > > --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org For additional commands, e-mail: users-help@cocoon.apache.org