incubator-jena-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andy Seaborne (JIRA)" <>
Subject [jira] [Commented] (JENA-44) Support external sorting of bindings in ARQ
Date Mon, 16 May 2011 20:16:47 GMT


Andy Seaborne commented on JENA-44:

a few questions:

1 - what can be shared between JENA-45 (spill to disk update)?  Code for to/from disk? 

JENA-45 uses SSE for tuples parsing (but what about bnodes?_
JENA-44 has BindingSerializer.

Is there anything in common?

What about the deferred buffering?

2 - Control of resources (part 1)

JENA-44 reads a symbol ; JENA-45 has a ThresholdPolicyCount.  Do we need a combination?

3 - Generally, RAM is a system-wide resource used for caching in TDB/32. Do we need to have
something adaptive?

> Support external sorting of bindings in ARQ
> -------------------------------------------
>                 Key: JENA-44
>                 URL:
>             Project: Jena
>          Issue Type: New Feature
>          Components: ARQ
>            Reporter: Sam Tunnicliffe
>            Assignee: Paolo Castagna
>            Priority: Minor
>         Attachments: JENA-44-0.patch, JENA-44_ARQ_r8531.patch, JENA-44_ARQ_r8724.patch
> In QueryIterSort, the sorting of the contents of an Iterator<Binding> is done in
memory, using Arrays.sort. This can be problematic where the set to be sorted is large. A
possible solution could be to use an external, disk-backed algorithm. A hybrid approach may
be better, whereby we attempt the in-memory sort, but when the number of bindings encountered
goes over a certain number, resort to the disk-backed variant.

This message is automatically generated by JIRA.
For more information on JIRA, see:

View raw message