Home Services Partners Company
Universal Cloudera ODBC connector for the Hadoop Big Data Ecosystem

Universal Cloudera ODBC connector for the Hadoop Big Data Ecosystem

April 09, 2013 0 Comments

Cloudera shops are really excited about the DataDirect Cloudera ODBC Hive driver to connect their enterprise.  Our connector represents the democratization of big data since it works with all ODBC compliant applications across all business platforms, and data is accessible to everyone that knows SQL, well beyond data scientists and programmers writing Java, Pig, or R.

So you have a successful big data implementation?

This is assumed since you're looking for enterprise ODBC connectivity which is a key indicator that your big data initiative is proving business value across the organization.  Demand for ODBC connectivity is cloudera_logoreally picking up, and I am getting several questions on support for multiple, concurrent connections and authentication with hive2 for which we introduced feature support in our 7.1 SP1 driver.  These shops are running CDH 4.1 to store mostly large scale transactional data.  It's exciting to see sponsorship at the C-level since these organizations understand the competitive advantage gained by having more departments derive business value from their big data.

What is a Universal Cloudera ODBC driver?

It's true there are open source ODBC drivers available.  However, the list of available connectors to download for Oracle, Teradata, Microstrategy, Netezza, Qlikview and Tableau are a mix between limited ODBC compliance and sqoop based connectors supporting different levels of Hive Server.

Here are the reasons YOU have shared with me (paraphrased in quotes) for choosing DataDirect for Cloudera ODBC connectivity:

  • "It just works". This is thanks to new ANSI SQL support including BETWEEN clause, Quoted column aliases, and support for all of HiveQL syntax.
  • "We need a single ODBC connector for all of our business systems".  A fully complaint ODBC driver enables connectivity from thousands of applications including: Teradata Parallel Transporter (TPT), SSIS, IBM DataStage, Ab Initio, Informatica PowerCenter, SAP Data Services, Business Objects, OBIEE, Cognos, SAS, SPSS, Unica, Linked Server, Oracle Database Gateway, and more.
  • "Need support for concurrent connections and authentication introduced in hive2." 
  • "We need 64-bit ODBC drivers for AIX".  Platform coverage is available across 32-bit and 64-bit Windows, Linux, AIX, Solaris, and HP-UX.
  • "As a BI developer, I just want my sysadmin to tell me what port number to connect to and let the DataDirect driver take care of the latest SQL technologies, such as Impala, coming out of the Hadoop ecosystem".

What are some DataDirect projects for Cloudera ODBC connectivity?

I am seeing hive2 enable enterprise application adoption for data warehousing, federation and visualization.  I am actively working on multiple POCs across the following use cases:

  • Load Hadoop data into SAP BW using SAP Data Services 4.0 via ODBC.
  • Support lookups from Oracle against historical transactional data in Hadoop using the Oracle Database Gateway for ODBC.  It is no longer necessary to schedule on demand load jobs to physically move the data.
  • Visualize raw data generated by point of sales (POS) systems using Tibco Spotfire.

My prediction is that we will see an increasing number of these projects with the release of hiveserver2 and Impala.

Support latest SQL technologies.

DataDirect Universal Cloudera ODBC driver will support the latest SQL technologies in Hadoop ecosystem.

Get started today

Sumit Sakar

Sumit Sarkar

Sumit Sarkar is a Chief Data Evangelist at Progress, with over 10 years experience working in the data connectivity field. The world's leading consultant on open data standards connectivity with cloud data, Sumit's interests include performance tuning of the data access layer for which he has developed a patent pending technology for its analysis; business intelligence and data warehousing for SaaS platforms; and data connectivity for aPaaS environments, with a focus on standards such as ODBC, JDBC, ADO.NET and ODATA. He is an IBM Certified Consultant for IBM Cognos Business Intelligence and TDWI member. He has presented sessions on data connectivity at various conferences including Dreamforce, Oracle OpenWorld, Strata Hadoop, MongoDB World and SAP Analytics and Business Objects Conference, among many others. 

Read next Using IBM DB2 JDBC Driver to Integrate DB2 with Amazon S3
Comments are disabled in preview mode.