directory-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alex Karasulu <aok...@bellsouth.net>
Subject Re: [ApacheDS] [Performance] Using indices to boost search performance
Date Sun, 03 Sep 2006 22:50:17 GMT
Ajay Upadhyaya wrote:
> Could you share the steps needed to do the indexing, or point to any 
> existing doc... I'm relatively new to ADS,

Sure no problem.  I wrote up a little documentation here on adding 
indices.  I just added this page just now very rapidly so it might be 
sparse.

http://docs.safehaus.org/display/APACHEDS/Performance+Tuning

Alex

> Thanks,
> Ajay
> 
> On 9/1/06, *Alex Karasulu * <aok123@bellsouth.net 
> <mailto:aok123@bellsouth.net>> wrote:
> 
>     Hi all,
> 
>     I've been testing the search performance boost gained from indexing
>     attributes before starting development on an optimization to improve
>     index performance and in memory size.  I thought I'd share these
>     dramatic results of my pre-optimization tests with you since they
>     clearly show the benefits of indices.
> 
>     Before I do this let me list the characteristics of the hardware used
>     and my configuration settings:
> 
> 
>     -------------
>     Machine Setup
>     -------------
> 
>     CPU: Dual Athlon MP 1900
>     OS: Linux 2.6.15
>     Mem: 2GB 266Mhz
> 
> 
>     --------------
>     ApacheDS Setup
>     --------------
> 
>     ApacheDS: Stock RC4 (to be released pre-image) w/ modifications
>        - Using 1024MB Memory
>        - Indexed st and initials
> 
> 
>     ----------
>     Data Setup
>     ----------
> 
>     Wrote a simple tool to generate random values for descent sized entries.
>       The data sort of looks like this for a user entry:
> 
>             dn: uid=user.1,ou=users,dc=example,dc=com
>             uid: user.1
>             initials: yq
>             description: cFocJATNuhlXisDCqGtY
>             pager: FYyimqyZRW
>             cn: HSGMzajYKmicUTe
>             postalcode: WiXXA
>             st: xy
>             street: kpCCqmrsCzkpdtHXWMfY
>             l: KqmAXFYTrI
>             objectclass: person
>             objectclass: organizationalPerson
>             objectclass: inetOrgPerson
>             sn: nuymgOwpm
>             homephone: PERamkCtsv
>             mobile: vkIviOGNTC
>             telephonenumber: 7248889026
>             mail: pYvEoOjSnEymcWD
>             givenname: IVHJZB
>             postaladdress: crObexKoUTIFdzNHcZMr
>             employeenumber: 1
>             userpassword:: cGFzc3dvcmQ=
> 
>     I started loading a partition up with these entries 100,000 of them at a
>     time then performing the following searches for all entries with
>     initials aa:
> 
>     (1) index on initials but no cached entries
>     (2) index on initials with cached entries
>     (3) no index without cached entries
> 
>     Here are the results at the various capacities:
> 
>     ---------------
>     100,000 Entries
>     ---------------
> 
>          [cached] [indexed] [time (seconds)]
> 
>     (1)    no       yes         3.30
>     (2)    yes      yes        0.72
>     (3)    no       no         30.63
> 
>     search results = 153 entries
> 
> 
>     ---------------
>     200,000 Entries
>     ---------------
> 
>          [cached] [indexed] [time (seconds)]
> 
>     (1)    no       yes        6.04
>     (2)    yes      yes        1.44
>     (3)    no       no         82
> 
>     search results = 302 entries
> 
> 
>     ---------------
>     300,000 Entries
>     ---------------
> 
>          [cached] [indexed] [time (seconds)]
> 
>     (1)    no       yes        7.54
>     (2)    yes      yes        1.95
>     (3)    no       no         146
> 
>     search results = 451 entries
> 
> 
>     ---------------
>     400,000 Entries
>     ---------------
> 
>          [cached] [indexed] [time (seconds)]
> 
>     (1)    no       yes        9.24
>     (2)    yes      yes        3.80
>     (3)    no       no         196
> 
>     search results = 586 entries
> 
> 
>     ---------------
>     500,000 Entries
>     ---------------
> 
>          [cached] [indexed] [time (seconds)]
> 
>     (1)    no       yes        11.96
>     (2)    yes      yes        3.21
>     (3)    no       no         224
> 
>     search results = 748 entries
> 
> 
>     Alex
> 
> 
> 


Mime
View raw message