db-derby-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sedillo, Derek \(Mission Systems\)" <Derek.Sedi...@ngc.com>
Subject RE: Large multi-record insert performance
Date Fri, 16 Mar 2007 15:58:34 GMT

The goal is to find the most efficient/optimized way to insert large
amounts of data into Derby.  For example in working as an Oracle DBA I
have discovered that I can bulk load data from ProC using an array of C
Structures in one insert statement like this:

VALUES (:tmp_ssj_data);  // Where tmp_ssj_data is an array (100s or
1000s) of structured records 

This approach greatly enhances performance for large data inserts which
we perform regularly. My original question is how can I do something
similar with Derby.

While I realize there are 'basic' constructs for performing a task, they
are not 'normally' optimal.  For example performing 1000 separate
transactions is less efficient than one.  See Daniel's numbers below.

Along these lines I did some testing for our own application.

Cached PreparedStatement but one transaction per insert:
100 tags added in 399ms
100 tags removed in 160ms
1000 tags added in 1163ms
1000 tags removed in 873ms
10000 tags added in 6094ms
10000 tags removed in 6840ms
100000 tags added in 58563ms
100000 tags removed in 67342ms

All in one transaction using executeUpdate():
100 tags added in 274ms
100 tags removed in 70ms
1000 tags added in 299ms
1000 tags removed in 250ms
10000 tags added in 1605ms
10000 tags removed in 1500ms
100000 tags added in 14441ms
100000 tags removed in 19721ms

All in one transaction using addBatch()/executeBatch():
100 tags added in 290ms
100 tags removed in 76ms
1000 tags added in 316ms
1000 tags removed in 258ms
10000 tags added in 1621ms
10000 tags removed in 1927ms
100000 tags added in 14971ms
100000 tags removed in 19320ms 

- Derek

-----Original Message-----
From: Dyre.Tjeldvoll@Sun.COM [mailto:Dyre.Tjeldvoll@Sun.COM] 
Sent: Friday, March 16, 2007 9:00 AM
To: Derby Discussion
Subject: Re: Large multi-record insert performance

I haven't followed this thread closely, but is there a reason why you
cannot do:

PreparedStatement ps =
conn.prepareStatement("insert into T values (?, ?, ?)");

for (int i = 0; i < 1000; ++i) {
    ps.setX(1, <whatever>);
    ps.setX(2, <whatever>); 
    ps.setX(3, <whatever>);



View raw message