ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ewan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (IGNITE-11928) keep data of same primary index on same node
Date Sun, 16 Jun 2019 02:55:00 GMT

     [ https://issues.apache.org/jira/browse/IGNITE-11928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Ewan updated IGNITE-11928:
    Issue Type: Improvement  (was: Wish)

> keep data of same primary index on same node
> --------------------------------------------
>                 Key: IGNITE-11928
>                 URL: https://issues.apache.org/jira/browse/IGNITE-11928
>             Project: Ignite
>          Issue Type: Improvement
>          Components: data structures, persistence
>    Affects Versions: None
>            Reporter: Ewan
>            Priority: Minor
>             Fix For: None
> I searched a lot and find very few documents talking about how does Ignite index and
how it use indices. What i hope Ignite to have is something like the partition key in Cassandra,
which is used by the database engine to find out which node in the cluster contains the row(s)
of a table. Since the partition key determines the node where the rows with the same partition
key stores, it will dramatically reduce the interaction between Ignite notes for a query,
especially when the query is within the data of a specified partition key.
> e.g. Table: profit of the companies by days, it has three columns:
> day: primary key / partition key
> company id: secondary key / column key
> profit: value of the row
> Query:
> select * from table where day = '2019-01-01';
> In Cassandra, the query will only touch one node to fetch all the data. The partition
key has a prerequisite that the users or the programmers will ensure that all the data with
same partition key will be small enough to be stored on one node of the cluster.
> But in Ignite, for the same query, it may touch the whole cluster since each node in
the cluster only stores a portion of the data with that primary key. This limits the performance
of Ignite to be linearly increased when more nodes added to the cluster. I personally think
the linear performance improvement is one of the key features a distributed database should
> I would like to recommend that Ignite adds an option/setting for users to determine if
all rows of same primary key to be stored on the same node. Thanks.

This message was sent by Atlassian JIRA

View raw message