Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1B28A9773 for ; Tue, 27 Sep 2011 14:34:08 +0000 (UTC) Received: (qmail 43957 invoked by uid 500); 27 Sep 2011 14:34:05 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 43932 invoked by uid 500); 27 Sep 2011 14:34:05 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 43924 invoked by uid 99); 27 Sep 2011 14:34:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 27 Sep 2011 14:34:05 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [64.6.108.239] (HELO ponto.amerinoc.com) (64.6.108.239) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 27 Sep 2011 14:33:58 +0000 Received: from fbsd8.localdomain (205.83.broadband7.iol.cz [88.102.83.205]) (authenticated bits=128) by ponto.amerinoc.com (8.14.5/8.14.5) with ESMTP id p8REXUjj059968 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Tue, 27 Sep 2011 16:33:34 +0200 (CEST) (envelope-from hsn@sendmail.cz) Received: from [127.0.0.1] ([10.0.0.1]) by fbsd8.localdomain (8.14.4/8.14.4) with ESMTP id p8REXNjj005813 for ; Tue, 27 Sep 2011 16:33:24 +0200 (CEST) (envelope-from hsn@sendmail.cz) Message-ID: <4E81DEAC.9010103@sendmail.cz> Date: Tue, 27 Sep 2011 16:33:16 +0200 From: Radim Kolar User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:6.0.2) Gecko/20110902 Thunderbird/6.0.2 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: Tool for SQL -> Cassandra data movement References: <4E7AFA5B.4080507@sendmail.cz> <4E7B75E7.6050901@sendmail.cz> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Antivirus: avast! (VPS 110927-0, 27.09.2011), Outbound message X-Antivirus-Status: Clean X-Virus-Checked: Checked by ClamAV on apache.org > I have cleaned up my code that imports CSV into Cassandra and I have put it open on https://github.com/nehalmehta/CSV2Cassandra. Have a look if it is useful to you. Hello, I will remake this tool into something which is like Oracle SQL*Loader. Basically, you will pass controlfile as command line argument. I need conversion from DATE to milliseconds based date, header less CSV and better CSV escaping. example of control file options (rows=1000) LOAD DATA INFILE 'c:\tmp\searches.csv' BADFILE 'c:\tmp\searches.bad' REPLACE INTO TABLE SEARCHES2 FIELDS TERMINATED BY "," OPTIONALLY ENCLOSED BY '"' ( query, day date 'YYYY-MM-DD', results, ip ) or maybe i will start project from 0