Return-Path: X-Original-To: apmail-jackrabbit-users-archive@minotaur.apache.org Delivered-To: apmail-jackrabbit-users-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1D2DDE9E8 for ; Mon, 11 Feb 2013 13:41:12 +0000 (UTC) Received: (qmail 40450 invoked by uid 500); 11 Feb 2013 13:41:11 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 40304 invoked by uid 500); 11 Feb 2013 13:41:11 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 40294 invoked by uid 99); 11 Feb 2013 13:41:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Feb 2013 13:41:11 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of laeubi@googlemail.com designates 209.85.215.178 as permitted sender) Received: from [209.85.215.178] (HELO mail-ea0-f178.google.com) (209.85.215.178) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Feb 2013 13:41:05 +0000 Received: by mail-ea0-f178.google.com with SMTP id a14so2826771eaa.9 for ; Mon, 11 Feb 2013 05:40:44 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=x-received:message-id:date:from:user-agent:mime-version:to:subject :references:in-reply-to:content-type:content-transfer-encoding; bh=ZdOm/YJCyL/1HZ2xnD1ibifozPmvNT9rlYHFD3010fM=; b=E+pK67R7xq59z02lDYO7/CDclk22g/JaGdjEtgMdYDDTqbDoXt19uv+oCwOfCTr/Dk 3YzmJuF6doEQA97KV9e8Qgqe/5QbeTbOe6+J8KJ7rKe3FNuNXhmZeOQMvdnMevszYl3K 4hAY6h6TU9KNQC2LLd0eBl3l3ouj45EKO07vtIC38Wf5yW5tLAELwtTgKU2rfriDk6e/ JArwtgMfa4x3p/p/WHvHrhD7SwlxPq3eHm60wHIOzVPiMrNWeKPu9oTC2+wa1pZCT86q zEydWMiXH9nNMwPNcVYj4u6pppgAsFIcXn6XLg3E7CAymMPaG2pt79J5m0sJR63qz3+w 75DQ== X-Received: by 10.14.3.133 with SMTP id 5mr17014654eeh.43.1360590043986; Mon, 11 Feb 2013 05:40:43 -0800 (PST) Received: from [192.168.178.42] (p5794F8CC.dip.t-dialin.net. [87.148.248.204]) by mx.google.com with ESMTPS id d47sm3788321eem.9.2013.02.11.05.40.42 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Mon, 11 Feb 2013 05:40:43 -0800 (PST) Message-ID: <5118F4DA.30701@googlemail.com> Date: Mon, 11 Feb 2013 14:40:42 +0100 From: =?UTF-8?B?Q2hyaXN0b3BoIEzDpHVicmljaA==?= User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.16) Gecko/20121215 Iceowl/1.0b1 Icedove/3.0.11 MIME-Version: 1.0 To: users@jackrabbit.apache.org Subject: Re: Is Jackrabbit suitable for storing lots of large files References: <5118E8F5.3030407@googlemail.com> <7628B7424DEF784CA2ECB07668F69CF4BE2E1280@S-HQMX8.pmbelz.de> In-Reply-To: <7628B7424DEF784CA2ECB07668F69CF4BE2E1280@S-HQMX8.pmbelz.de> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org Hi Robert, thanks for your reply, I think it is suitable in my usecase to disable the index at all since full-text-search will not be required. I'll try to get the Datastore up and running, I'm only do not want to create my application and find out that above some limit of overall storage Jackrabbit stop working or something because JCR and/or Jackrabbit is the wrong technology at all to be used as a meta-filestore. Regards Christoph Am 11.02.2013 14:28, schrieb Seidel. Robert: > Hi, > > storing is not the problem, cause this is all done by streaming. But you can encounter problems if you want to index such data, because Lucene holds all tokens for a file in memory (no streaming here). > The default configuration stores 10K tokens max. per property (see maxFieldLength in http://wiki.apache.org/jackrabbit/Search). > But this can be real frustrating if the 10001. token is searched - it is also not very transparent for the user. > If you increase this value, you need more memory. > > Imho you have to decide to index all tokens (with enough memory) or nothing for this data. > > Regards, Robert > > -----Ursprüngliche Nachricht----- > Von: Bertrand Delacretaz [mailto:bdelacretaz@apache.org] > Gesendet: Montag, 11. Februar 2013 13:59 > An: users@jackrabbit.apache.org > Betreff: Re: Is Jackrabbit suitable for storing lots of large files > > Hi, > > On Mon, Feb 11, 2013 at 1:49 PM, Christoph Läubrich wrote: > >> I read the performance doc here >> http://wiki.apache.org/jackrabbit/Performance but did not find an answer: >> Is Jackrabbit suitable for storing lots of files (around 100GB) with >> each file around 2-200MB? >> > As usual with performance you'll need to do your own tests, but that shouldn't be a problem if you use the datastore [1] to store the binary content. > > -Bertrand > > [1] http://wiki.apache.org/jackrabbit/DataStore > ________________________________ > > Treffen Sie AEB vom 19.-21. Februar 2013 auf der LogiMAT in Stuttgart. Halle 5, Stand 261. > Vereinbaren Sie jetzt einen Termin und Sie erhalten eine Eintrittskarte. > www.aeb.de/logimat >