Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 4808 invoked from network); 8 Sep 2010 22:46:12 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 8 Sep 2010 22:46:12 -0000 Received: (qmail 6945 invoked by uid 500); 8 Sep 2010 22:46:10 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 6906 invoked by uid 500); 8 Sep 2010 22:46:10 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 6898 invoked by uid 99); 8 Sep 2010 22:46:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Sep 2010 22:46:09 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [69.89.66.126] (HELO hosting.clearpathnet.com) (69.89.66.126) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Sep 2010 22:46:05 +0000 Received: from ismtp.cpn.net (splanfs01 [10.101.10.5]) by hosting.clearpathnet.com (8.13.1/8.13.1) with ESMTP id o88MjAx1010950 for ; Wed, 8 Sep 2010 15:45:11 -0700 Received: from exchange.clearpathnet.com (splaexch01 [10.2.4.105]) by ismtp.cpn.net (8.13.8/8.13.8) with ESMTP id o88MjgIH031599 for ; Wed, 8 Sep 2010 15:45:42 -0700 Received: from splaexch01.cpn.net ([192.168.254.105]) by splaexch01.cpn.net ([192.168.254.105]) with mapi; Wed, 8 Sep 2010 15:45:15 -0700 From: Alex Burkoff To: "user@cassandra.apache.org" Date: Wed, 8 Sep 2010 15:41:57 -0700 Subject: RE: ColumnFamilyOutputFormat and mapreduce.output.columnfamilyoutputformat.batch.threshold 0.70beta1 Thread-Topic: ColumnFamilyOutputFormat and mapreduce.output.columnfamilyoutputformat.batch.threshold 0.70beta1 Thread-Index: ActPnIbVsPdiXq1bS0urCDriOMwglQACoRc2 Message-ID: <6AE62907C5654F46B056A3AB96A6E00F12F8AEEF9D@splaexch01.cpn.net> References: <6AE62907C5654F46B056A3AB96A6E00F12F8AEEF9C@splaexch01.cpn.net>, In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 With the trunk version and given patches I am now getting following excepti= on: 10/09/08 22:39:14 WARN mapred.LocalJobRunner: job_local_0001 java.lang.ClassCastException: [B cannot be cast to java.nio.ByteBuffer at org.apache.cassandra.hadoop.ColumnFamilyRecordWriter.write(Colum= nFamilyRecordWriter.java:68) at org.apache.hadoop.mapred.ReduceTask$NewTrackingRecordWriter.writ= e(ReduceTask.java:508) at org.apache.hadoop.mapreduce.TaskInputOutputContext.write(TaskInp= utOutputContext.java:80) at cassandratest.Main$TReducer.reduce(Main.java:132) at cassandratest.Main$TReducer.reduce(Main.java:113) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:176) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.jav= a:566) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:408) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.j= ava:216) 10/09/08 22:39:14 INFO mapred.JobClient: map 100% reduce 0% Alex. ________________________________________ From: Jonathan Ellis [jbellis@gmail.com] Sent: Wednesday, September 08, 2010 2:26 PM To: user@cassandra.apache.org Subject: Re: ColumnFamilyOutputFormat and mapreduce.output.columnfamilyoutp= utformat.batch.threshold 0.70beta1 Try the patches on https://issues.apache.org/jira/browse/CASSANDRA-1434 (or wait until they're committed to trunk, then try a nightly build) On Wed, Sep 8, 2010 at 4:18 PM, Alex Burkoff wr= ote: > Guys, > > I was testing ColumnFamilyOutputFormat and found that only columns from t= he last Reduce > invocation get stored when mapreduce.output.columnfamilyoutputformat.batc= h.threshold has > the default value. Setting it to 1 changes the behavior, and all data is = stored then. Is it the > intended behavior, or am I missing something ? > > Best regards, > > Alex Burkoff > -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com