Services
90% Faster Hive Queries with New Progress Hive ODBC Driver

90% Faster Hive Queries with New Progress Hive ODBC Driver

April 05, 2017 0 Comments
Apache Hive ODBC Driver

Are you looking for the best performance for your Hive ODBC queries? Our new release offers fast ODBC connectivity for big data analytics.

Advanced analytics, along with data scientists and analysts, are driving the need for speed around Hadoop Hive for both batch and interactive workloads. And we heard loud and clear from 1,200+ survey respondents that Hive remains the most popular big data interface to access data from our annual survey (2017 results to be published soon—stay tuned).

That’s why we’re pleased to announce critical performance enhancements in the new version of our Apache Hive ODBC driver. Our Hive connectors provide the fast, superior connectivity needed for data management applications and BI and analytics tools such as Power BI or Qlik.

Our ODBC connector for Hadoop Hive supports all Hive distributions out-of-the-box and includes the Progress Security Vulnerability Response Policy, as well as “Day One” support. You get full support for any new version of Hive from day one for the business platform of your choice, across AIX, Linux, Solaris*, HPUX* and Windows (*supported with 7.1).

In addition to SQL access, you can leverage our hybrid technology to produce a standard REST API (OData) to operationalize Hadoop and instantly connect popular OData consumers such as Salesforce, Oracle Service Cloud and Tableau.

Customer Use Cases

Native Hive ODBC drivers are provided with each Hadoop distribution to support basic tasks on a workstation, such as pulling data into Microsoft Excel from that specific Hadoop Hive distribution and version. In contrast, Progress DataDirect drivers are engineered for applications, and include enterprise support and features across all versions and distributions of Hadoop Hive.

MicroStrategy distributes DataDirect Hive ODBC drivers with its analytics platform for faster access via SQL to Hadoop data. The company’s customers need to conduct analysis and perform highly complex queries against Hive. MicroStrategy helps them get to the data and insights faster.

IBM Campaign recommends DataDirect Hive ODBC drivers to query customer data for targeted marketing campaigns. Rather than certify multiple versions within each of the seven distributions of Hadoop Hive, IBM can certify one driver to support all its customers’ big data platforms.

Key Takeaways for the Hive ODBC Driver

The new Apache Hadoop Hive ODBC driver offers major performance enhancements for querying and processing truly high volume, complex data sets:

  • Up to 90% performance gain over our last release for fetch performance and pre-fetch optimizations for large result sets
  • New metadata access methods enable you to optimize for performance, information detail or a balance of both
  • Multi-row insert capability improves batch import times
  • Pre-configured TDC files improves compatibility with Tableau and enable you to fine-tune ODBC connectivity to improve performance of complex SQL statements

Technical Specs

32- and 64-bit drivers are available for all supported databases and platforms unless otherwise noted.

ODBC Version Support

Compatible with ODBC 3.8 applications

Protocol Support

HiveServer2

Hive Version Support

Supports Apache Hive version 1.0 and higher against the following distributions:

  • Amazon Elastic MapReduce (Amazon EMR), version 4.0 and higher
  • Apache Hadoop Hive
  • Cloudera's Distribution including Apache Hadoop (CDH), version CDH5.4 and higher
  • Hortonworks Distribution for Apache Hadoop, version 2.3 and higher
  • IBM BigInsights, version 4.0 and higher
  • MapR Distribution for Apache Hadoop, version 5.0 and higher
  • Pivotal HD Enterprise (PHD), version 3.0 and higher

Operating System Support

AIX (32- and 64-bit)

  • AIX, version 5.3 and higher

Linux X86 (32- and 64-bit for AMD and INTEL Processors)

  • CentOS Linux x86, version 4.0 and higher
  • Debian Linux x86, version 7.0 and higher
  • Oracle Linux x86, version 4.0 and higher
  • Red Hat Enterprise Linux x86, version 4.0 and higher
  • SUSE Linux Enterprise Server Linux x86, version 10 and higher
  • Ubuntu Linux x86, version 14.04 and higher

Windows (32- and 64-BIT)

  • Windows (x86), version 7 and higher
  • Windows Server (x86), version 2003 and higher
  • Windows Vista (x86)
  • Windows XP Professional (x86), SP2 or higher

Driver/Client Software Requirements

No Requirements

Get Started

Start using high performance ODBC connectivity to Apache Hadoop Hive today. Try it now for free. 

TRY IT NOW

Sumit Sakar

Sumit Sarkar

Sumit Sarkar is a Chief Data Evangelist at Progress, with over 10 years experience working in the data connectivity field. The world's leading consultant on open data standards connectivity with cloud data, Sumit's interests include performance tuning of the data access layer for which he has developed a patent pending technology for its analysis; business intelligence and data warehousing for SaaS platforms; and data connectivity for aPaaS environments, with a focus on standards such as ODBC, JDBC, ADO.NET and ODATA. He is an IBM Certified Consultant for IBM Cognos Business Intelligence and TDWI member. He has presented sessions on data connectivity at various conferences including Dreamforce, Oracle OpenWorld, Strata Hadoop, MongoDB World and SAP Analytics and Business Objects Conference, among many others. 

Read next 2017 Data Connectivity Trends in SaaS, Relational & Big Data
Comments
Comments are disabled in preview mode.