Return-Path: Delivered-To: apmail-jackrabbit-users-archive@locus.apache.org Received: (qmail 8126 invoked from network); 15 Dec 2006 22:25:47 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 15 Dec 2006 22:25:47 -0000 Received: (qmail 59144 invoked by uid 500); 15 Dec 2006 22:25:52 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 59128 invoked by uid 500); 15 Dec 2006 22:25:52 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 59118 invoked by uid 99); 15 Dec 2006 22:25:52 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Dec 2006 14:25:52 -0800 X-ASF-Spam-Status: No, hits=3.5 required=10.0 tests=HTML_10_20,HTML_MESSAGE,NO_REAL_NAME X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [66.219.42.197] (HELO mail.21technologies.com) (66.219.42.197) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Dec 2006 14:25:42 -0800 To: users@jackrabbit.apache.org Subject: Storing data in a non-public schema and a postgres persistance manager MIME-Version: 1.0 X-Mailer: Lotus Notes Release 7.0.1 January 17, 2006 Message-ID: From: jdente@21technologies.com Date: Fri, 15 Dec 2006 16:10:57 -0600 X-MIMETrack: Serialize by Router on Austin/21ct(Release 7.0.1|January 17, 2006) at 12/15/2006 04:11:07 PM, Serialize complete at 12/15/2006 04:11:07 PM Content-Type: multipart/alternative; boundary="=_alternative 007B2BBE86257245_=" X-Virus-Checked: Checked by ClamAV on apache.org --=_alternative 007B2BBE86257245_= Content-Type: text/plain; charset="US-ASCII" Hi, I have two questions: First, is there a way to configure Jackrabbit to store data someplace other than the default public schema of a database? The "schema" argument in the repository configuration file refers to the type of schema. The only way I can control where data goes in the database is by changing the schema object prefix. It would be nice if I could setup different schemas within the same database for different repositories that I have setup for testing. Second, I've continued playing around with Jackrabbit performance under PostgreSQL. Because Jackrabbit uses the bytea data type to store blobs in the database, it performs poorly creating a giant memory footprint that is dependent on the size of the data that is being put into the database. This memory footprint requires a JVM heap size of anywhere from 4 times to 8 times the size of the data being loaded or stored. There are several articles that can be found about postgres' problems with blobs and the memroy consumption that is a result of using the bytea data type. It appears that these memory problems can be avoided if I use the LargeObject API instead of the bytea data type to represent my blobs (see http://jdbc.postgresql.org/documentation/82/binary-data.html), although I have not tried it yet. I am considering building a PostgresPersistenceManager by extending SimpleDBPersistenceManager that will use LargeObjects instead of bytea. Has anybody tried using LargeObjects with Jackrabbit instead of bytea? Is there a reason this approach won't work? Why did Jackrabbit use bytea to begin with? Thanks for your help, Joe. --=_alternative 007B2BBE86257245_=--