Return-Path: Delivered-To: apmail-jackrabbit-dev-archive@www.apache.org Received: (qmail 72558 invoked from network); 13 Nov 2008 10:08:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 13 Nov 2008 10:08:06 -0000 Received: (qmail 49804 invoked by uid 500); 13 Nov 2008 10:08:12 -0000 Delivered-To: apmail-jackrabbit-dev-archive@jackrabbit.apache.org Received: (qmail 49770 invoked by uid 500); 13 Nov 2008 10:08:12 -0000 Mailing-List: contact dev-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@jackrabbit.apache.org Delivered-To: mailing list dev@jackrabbit.apache.org Received: (qmail 49757 invoked by uid 99); 13 Nov 2008 10:08:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Nov 2008 02:08:12 -0800 X-ASF-Spam-Status: No, hits=-4.0 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [194.8.61.7] (HELO spamslammer1.tirol.gv.at) (194.8.61.7) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Nov 2008 10:06:53 +0000 Received: from mailscan1.tirol.local (unknown [10.10.128.204]) by spamslammer1.tirol.gv.at (BorderWare Security Platform) with ESMTP id E39DE1A28DB532CA for ; Thu, 13 Nov 2008 11:07:04 +0100 (CET) Received: from mxs0.tirol.local (unverified) by mailscan1.tirol.local (Clearswift SMTPRS 5.2.9) with ESMTP id for ; Thu, 13 Nov 2008 11:07:04 +0100 Received: from mxs01.tirol.local ([10.10.128.211]) by mxs0.tirol.local with Microsoft SMTPSVC(6.0.3790.1830); Thu, 13 Nov 2008 11:07:04 +0100 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Subject: AW: Workspace.copy() Question ... Date: Thu, 13 Nov 2008 11:07:02 +0100 Message-ID: In-Reply-To: <91f3b2650811120636t63fe0d1al7f4952005e064774@mail.gmail.com> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Workspace.copy() Question ... thread-index: AclE1CV0DCPweE3CSciQeNxDO+vhKwAoY8zw References: <91f3b2650811110106x3c2b576al25ed830de10aa470@mail.gmail.com> <510143ac0811120310s4f6bdbe8g5f6ec8361c00c6c4@mail.gmail.com> <91f3b2650811120636t63fe0d1al7f4952005e064774@mail.gmail.com> From: =?iso-8859-1?Q?K=D6LL_Claus?= To: X-OriginalArrivalTime: 13 Nov 2008 10:07:04.0268 (UTC) FILETIME=[93F7B8C0:01C94577] X-Virus-Checked: Checked by ClamAV on apache.org Hi Thomas, >Instead of returning an InputStream, Jackrabbit would return a >DataStoreInputStream with the additional method getDataIdentifier(). >Then the module can read the identifier, check if the item is already >processed, and avoid reading the data itself if this identifier is >already processed.=20 What does this exactly mean ? would you store the dataidentifier in the = index=20 and so in all modules ? But what will you do in the case if you try to copy a node internaly .. the datastore should know that he must not read the = binary to prevent extra read and write to the datastore. >For text extraction, a separate >file may make sense, but probably not for 'virus scan' because that's >only a flag (you don't need the data). Thumbnails: for better >performance you want to keep them together, and not save them >separately (that is, in the data store). can you explain this a little bit more .. i dont know what=20 viruscan and thumbnails have to do with that problem. i think i can not follow your thoughts at all=20 greets claus