cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Ellis (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-1966) Option to control how many items are read on cache load
Date Tue, 26 Jul 2011 19:01:09 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-1966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13071284#comment-13071284
] 

Jonathan Ellis commented on CASSANDRA-1966:
-------------------------------------------

Hmm, this breaks SerializationsTest:

{noformat}
    [junit] org.apache.avro.AvroTypeException: Non-numeric default value for int: null
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.encode(ResolvingGrammarGenerator.java:364)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.encode(ResolvingGrammarGenerator.java:341)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.getBinary(ResolvingGrammarGenerator.java:285)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.resolveRecords(ResolvingGrammarGenerator.java:264)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.generate(ResolvingGrammarGenerator.java:118)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.resolveRecords(ResolvingGrammarGenerator.java:254)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.generate(ResolvingGrammarGenerator.java:118)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.generate(ResolvingGrammarGenerator.java:151)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.resolveUnion(ResolvingGrammarGenerator.java:186)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.generate(ResolvingGrammarGenerator.java:120)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.resolveRecords(ResolvingGrammarGenerator.java:254)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.generate(ResolvingGrammarGenerator.java:118)
    [junit] 	at org.apache.avro.io.parsing.ResolvingGrammarGenerator.generate(ResolvingGrammarGenerator.java:50)
    [junit] 	at org.apache.avro.io.ResolvingDecoder.resolve(ResolvingDecoder.java:76)
    [junit] 	at org.apache.avro.io.ResolvingDecoder.<init>(ResolvingDecoder.java:46)
    [junit] 	at org.apache.avro.generic.GenericDatumReader.getResolver(GenericDatumReader.java:93)
    [junit] 	at org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:103)
    [junit] 	at org.apache.cassandra.io.SerDeUtils.deserializeWithSchema(SerDeUtils.java:89)
    [junit] 	at org.apache.cassandra.db.migration.SerializationsTest.testRead(SerializationsTest.{noformat}

Can you fix?

> Option to control how many items are read on cache load
> -------------------------------------------------------
>
>                 Key: CASSANDRA-1966
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1966
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Chris Burroughs
>            Assignee: Chris Burroughs
>            Priority: Minor
>             Fix For: 1.0
>
>         Attachments: 1966-v1.txt, 1966-v2.txt
>
>
> CASSANDRA-1417 added an option to save the key and/or row cache keys which is cool. 
However, for a row large cache it can take a long time to read all of the rows.  For example
I have a 400,000 item row cache, and loading that on restart takes a little under an hour.
> In addition to configuring the size of the row cache, and how often it should be saved
to disk, I propose an option to control how many items are loaded on startup (or alternately
only saving n items out of the full row cache to begin with).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message