directory-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Emmanuel Lecharny (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DIRSERVER-1965) An Index should speed up searches starting with '*'
Date Thu, 17 Jul 2014 14:13:04 GMT

    [ https://issues.apache.org/jira/browse/DIRSERVER-1965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064947#comment-14064947
] 

Emmanuel Lecharny commented on DIRSERVER-1965:
----------------------------------------------

Some other LDAP servers are indexing triplets. For instance, a sentence like :
"Hello World!" will be indexed using the following triplets :
'hel', 'ell', 'llo', 'lo ', 'o w'...

If we have a N letters sentence, we will create N-3 entries in the index. This is very expensive.
OTOH, it allows all the kind of searches for substring.

We might want to evaluate the pros and cons of such an approach.

> An Index should speed up searches starting with '*'
> ---------------------------------------------------
>
>                 Key: DIRSERVER-1965
>                 URL: https://issues.apache.org/jira/browse/DIRSERVER-1965
>             Project: Directory ApacheDS
>          Issue Type: Improvement
>          Components: ldap
>    Affects Versions: 2.0.0-M16
>         Environment: NA
>            Reporter: Ernst Bech
>            Assignee: Kiran Ayyagari
>            Priority: Blocker
>
> As of now an index will speed up searches for an exact match string and a substring ending
with a "*".
> It does NOT currently speed up searches starting with a "*", which is what we need to
be implemented.
> Example: in our unified messaging application - when we receive an incoming call - we
have to find a user whose telephone number ends with the digits signaled to us by the telephone
network.
> Let's say a user has a telephone number +49(777)12345678. The telephone network only
signals its extension 678. We thus search for *678 to get a list of possible users and by
applying some extra magic we are able to pick the right one.
> The problem is: we need to do this fast, because taking the call depends on it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message