cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Geoffrey Yu (JIRA)" <>
Subject [jira] [Updated] (CASSANDRA-12367) Add an API to request the size of a CQL partition
Date Thu, 25 Aug 2016 20:52:20 GMT


Geoffrey Yu updated CASSANDRA-12367:
    Attachment: 12367-trunk-v2.txt

I've attached another patch that adds in a new statement to CQL as described in the ticket
for some early feedback on the approach. It's implemented as a new statement since the semantics
of what it is meant to do did not fit in well with the existing {{SELECT}} statement.

cqlsh> SELECT SIZE FROM demo.test WHERE type = 'person';

 endpoint  | size (bytes)
-----------+-------------- |          338 |          338

(2 rows)

The statement needs to be restricted to a single partition, and returns results based on the
consistency level (here it was {{ALL}} on a keyspace with RF=2).

Could we use SSTableReader.getScanner(Range<Token> range, ...) instead of scanning all
the partitions in the sstable? We would need to create the range so that it includes the token
requested but I think it should save us some time by seeking to the correct position directly.

Using {{SSTableReader.getScanner(Range<Token> range, ...)}} makes sense. Is there a
recommended approach for creating a small {{Range<Token>}} that will wrap the requested
token? For a {{LongToken}} it seems straightforward to just decrease the token in value slightly
to create a range, but I'm not quite sure what a reasonable approach might look like for all
the different types of tokens.

> Add an API to request the size of a CQL partition
> -------------------------------------------------
>                 Key: CASSANDRA-12367
>                 URL:
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Geoffrey Yu
>            Assignee: Geoffrey Yu
>            Priority: Minor
>             Fix For: 3.x
>         Attachments: 12367-trunk-v2.txt, 12367-trunk.txt
> It would be useful to have an API that we could use to get the total serialized size
of a CQL partition, scoped by keyspace and table, on disk.

This message was sent by Atlassian JIRA

View raw message