Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 89E637ED4 for ; Wed, 7 Dec 2011 23:55:02 +0000 (UTC) Received: (qmail 18576 invoked by uid 500); 7 Dec 2011 23:55:02 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 18548 invoked by uid 500); 7 Dec 2011 23:55:02 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 18540 invoked by uid 99); 7 Dec 2011 23:55:02 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Dec 2011 23:55:02 +0000 X-ASF-Spam-Status: No, hits=-2001.2 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Dec 2011 23:55:01 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 2BB7F106596 for ; Wed, 7 Dec 2011 23:54:41 +0000 (UTC) Date: Wed, 7 Dec 2011 23:54:41 +0000 (UTC) From: "Jonathan Ellis (Updated) (JIRA)" To: commits@cassandra.apache.org Message-ID: <1335706324.51534.1323302081180.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1521278267.948.1297997655371.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (CASSANDRA-2189) json2sstable fails due to OutOfMemory MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Ellis updated CASSANDRA-2189: -------------------------------------- Attachment: 2189-2.txt Patch attached to move configuration to the factory. > json2sstable fails due to OutOfMemory > ------------------------------------- > > Key: CASSANDRA-2189 > URL: https://issues.apache.org/jira/browse/CASSANDRA-2189 > Project: Cassandra > Issue Type: Bug > Components: Tools > Environment: linux > Reporter: Shotaro Kamio > Assignee: Jonathan Ellis > Priority: Minor > Fix For: 0.8.9 > > Attachments: 2189-2.txt, 2189.txt > > Original Estimate: 1h > Remaining Estimate: 1h > > I have a json file created with sstable2json for a column family of super column type. Its size is about 1.9GB. (It's a dump of all keys because I cannot find out how to specify keys to dump in sstable2json.) > When I tried to create sstable from the json file, it failed with OutOfMemoryError as follows. > WARN 00:31:58,595 Schema definitions were defined both locally and in cassandra.yaml. Definitions in cassandra.yaml were ignored. > Exception in thread "main" java.lang.OutOfMemoryError: PermGen space > at java.lang.String.intern(Native Method) > at org.codehaus.jackson.util.InternCache.intern(InternCache.java:40) > at org.codehaus.jackson.sym.BytesToNameCanonicalizer.addName(BytesToNameCanonicalizer.java:471) > at org.codehaus.jackson.impl.Utf8StreamParser.addName(Utf8StreamParser.java:893) > at org.codehaus.jackson.impl.Utf8StreamParser.findName(Utf8StreamParser.java:773) > at org.codehaus.jackson.impl.Utf8StreamParser.parseLongFieldName(Utf8StreamParser.java:379) > at org.codehaus.jackson.impl.Utf8StreamParser.parseMediumFieldName(Utf8StreamParser.java:347) > at org.codehaus.jackson.impl.Utf8StreamParser._parseFieldName(Utf8StreamParser.java:304) > at org.codehaus.jackson.impl.Utf8StreamParser.nextToken(Utf8StreamParser.java:140) > at org.codehaus.jackson.map.deser.UntypedObjectDeserializer.mapObject(UntypedObjectDeserializer.java:93) > at org.codehaus.jackson.map.deser.UntypedObjectDeserializer.deserialize(UntypedObjectDeserializer.java:65) > at org.codehaus.jackson.map.deser.MapDeserializer._readAndBind(MapDeserializer.java:197) > at org.codehaus.jackson.map.deser.MapDeserializer.deserialize(MapDeserializer.java:145) > at org.codehaus.jackson.map.deser.MapDeserializer.deserialize(MapDeserializer.java:23) > at org.codehaus.jackson.map.ObjectMapper._readValue(ObjectMapper.java:1261) > at org.codehaus.jackson.map.ObjectMapper.readValue(ObjectMapper.java:517) > at org.codehaus.jackson.JsonParser.readValueAs(JsonParser.java:897) > at org.apache.cassandra.tools.SSTableImport.importUnsorted(SSTableImport.java:208) > at org.apache.cassandra.tools.SSTableImport.importJson(SSTableImport.java:197) > at org.apache.cassandra.tools.SSTableImport.main(SSTableImport.java:421) > So, what I had to is that split the json file with "split" command and modify them to be correct json file. Create sstable for each small json files. > Could you change json2sstable to avoid OutOfMemory? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira