hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alex Kamil <alex.ka...@gmail.com>
Subject Re: ETL using Hadoop
Date Thu, 09 Oct 2014 13:50:46 GMT
the fastest way to do ETL on Hadoop is via Hbase+Phoenix JDBC driver
<http://phoenix.apache.org/>,
as for ODBC mapping you could use Thrift  or one of the ODBC-JDBC bridges
<http://stackoverflow.com/questions/5352956/odbc-jdbc-bridge-that-maps-its-own-calls-to-jdbc-driver>

On Thu, Oct 9, 2014 at 8:16 AM, Andrew Machtolff <amachtolff@askcts.com>
wrote:

>  The closest thing I can think of to a .NET API would be to set up Hive
> external tables, and use a vendor’s (Cloudera, et al.) ODBC driver. You
> could connect from your .NET app using ODBC to the Hive tables, and
> SELECT/INSERT to read/write. If you’re desperate. J
>
>
>
> As far as ETL, I’d recommend you give SyncSort DMX-h a try. It’s a great
> little ETL tool that can translate its ETL tasks to MapReduce jobs. I’ve
> been using it for almost a year now, and it’s fantastic. Blazing fast, and
> with a trial download.
>
> (Disclaimer: I’m not affiliated with SyncSort, other than being a happy
> customer)
>
>
>
> Andrew
>
>
>
> *________________________________*
>
> Andrew Machtolff / Senior Consultant
>
> 205.259.2558 o
> 205.447.0956 c
>
> 205.259.2301 f
>
> [image: http://images.askcts.com/images/cts_logo_email.png]
> <http://www.askcts.com/>
> www.askcts.com
> amachtolff@askcts.com
> [image: http://images.askcts.com/images/cts_logo_linkedin.png]
> <http://www.linkedin.com/company/cts-inc>[image:
> http://images.askcts.com/images/cts_logo_twitter.png]
> <https://twitter.com/askCTS>[image:
> http://images.askcts.com/images/cts_logo_facebook.png]
> <https://www.facebook.com/askCTS>
>
>
>
> *From:* Azuryy Yu [mailto:azuryyyu@gmail.com]
> *Sent:* Wednesday, October 08, 2014 1:41 AM
> *To:* user@hadoop.apache.org
> *Subject:* Re: ETL using Hadoop
>
>
>
> Hi Moin,
>
> Yes, you can replace your ETL using hadoop. but it would be a big change.
>  such as data collection, pre-process, ETL tasks rewrite etc.
>
>
>
> I don't think there is .NET API in Hadoop.
>
>
>
> On Wed, Oct 8, 2014 at 2:31 PM, Dattatrya Moin <dattatryamoin@gmail.com>
> wrote:
>
>
>
> Hi ,
>
>
>
> We have our own ETL , but we are planning to use Hadoop for data
> processing as it gives better scalability and performance. As i am new to
> hadoop kindly guide to start with Hadoop. Can we replace ETL using Hadoop .
> And we have any API to connect to Hadoop using .NET.
>
>
>
>
>
> Thanks,
>
> Dattatrya Moin
>
>
>
>
>

Mime
View raw message