90% Faster Hive Queries with New Progress Hive ODBC Driver

90% Faster Hive Queries with New Progress Hive ODBC Driver

Posted on April 05, 2017 0 Comments
Apache Hive ODBC Driver

Are you looking for the best performance for your Hive ODBC queries? Our new release offers fast ODBC connectivity for big data analytics.

Advanced analytics, along with data scientists and analysts, are driving the need for speed around Hadoop Hive for both batch and interactive workloads. And we heard loud and clear from 1,200+ survey respondents that Hive remains the most popular big data interface to access data from our annual survey (2017 results to be published soon—stay tuned).

That’s why we’re pleased to announce critical performance enhancements in the new version of our Apache Hive ODBC driver. Our Hive connectors provide the fast, superior connectivity needed for data management applications and BI and analytics tools such as Power BI or Qlik.

Our ODBC connector for Hadoop Hive supports all Hive distributions out-of-the-box and includes the Progress Security Vulnerability Response Policy, as well as “Day One” support. You get full support for any new version of Hive from day one for the business platform of your choice, across AIX, Linux, Solaris*, HPUX* and Windows (*supported with 7.1).

In addition to SQL access, you can leverage our hybrid technology to produce a standard REST API (OData) to operationalize Hadoop and instantly connect popular OData consumers such as Salesforce, Oracle Service Cloud and Tableau.

Customer Use Cases

Native Hive ODBC drivers are provided with each Hadoop distribution to support basic tasks on a workstation, such as pulling data into Microsoft Excel from that specific Hadoop Hive distribution and version. In contrast, Progress DataDirect drivers are engineered for applications, and include enterprise support and features across all versions and distributions of Hadoop Hive.

MicroStrategy distributes DataDirect Hive ODBC drivers with its analytics platform for faster access via SQL to Hadoop data. The company’s customers need to conduct analysis and perform highly complex queries against Hive. MicroStrategy helps them get to the data and insights faster.

IBM Campaign recommends DataDirect Hive ODBC drivers to query customer data for targeted marketing campaigns. Rather than certify multiple versions within each of the seven distributions of Hadoop Hive, IBM can certify one driver to support all its customers’ big data platforms.

Key Takeaways for the Hive ODBC Driver

The new Apache Hadoop Hive ODBC driver offers major performance enhancements for querying and processing truly high volume, complex data sets:

  • Up to 90% performance gain over our last release for fetch performance and pre-fetch optimizations for large result sets
  • New metadata access methods enable you to optimize for performance, information detail or a balance of both
  • Multi-row insert capability improves batch import times
  • Pre-configured TDC files improves compatibility with Tableau and enable you to fine-tune ODBC connectivity to improve performance of complex SQL statements

Technical Specs

32- and 64-bit drivers are available for all supported databases and platforms unless otherwise noted.

ODBC Version Support

Compatible with ODBC 3.8 applications

Protocol Support

HiveServer2

Hive Version Support

Supports Apache Hive version 1.0 and higher against the following distributions:

  • Amazon Elastic MapReduce (Amazon EMR), version 4.0 and higher
  • Apache Hadoop Hive
  • Cloudera's Distribution including Apache Hadoop (CDH), version CDH5.4 and higher
  • Hortonworks Distribution for Apache Hadoop, version 2.3 and higher
  • IBM BigInsights, version 4.0 and higher
  • MapR Distribution for Apache Hadoop, version 5.0 and higher
  • Pivotal HD Enterprise (PHD), version 3.0 and higher

Operating System Support

AIX (32- and 64-bit)

  • AIX, version 5.3 and higher

Linux X86 (32- and 64-bit for AMD and INTEL Processors)

  • CentOS Linux x86, version 4.0 and higher
  • Debian Linux x86, version 7.0 and higher
  • Oracle Linux x86, version 4.0 and higher
  • Red Hat Enterprise Linux x86, version 4.0 and higher
  • SUSE Linux Enterprise Server Linux x86, version 10 and higher
  • Ubuntu Linux x86, version 14.04 and higher

Windows (32- and 64-BIT)

  • Windows (x86), version 7 and higher
  • Windows Server (x86), version 2003 and higher
  • Windows Vista (x86)
  • Windows XP Professional (x86), SP2 or higher

Driver/Client Software Requirements

No Requirements

Get Started

Start using high performance ODBC connectivity to Apache Hadoop Hive today. Try it now for free. 

TRY IT NOW

Sumit Sakar

Sumit Sarkar

Technology researcher, thought leader and speaker working to enable enterprises to rapidly adopt new technologies that are adaptive, connected and cognitive. Sumit has been working in the data access infrastructure field for over 10 years servicing web/mobile developers, data engineers and data scientists. His primary areas of focus include cross platform app development, serverless architectures, and hybrid enterprise data management that supports open standards such as ODBC, JDBC, ADO.NET, GraphQL, OData/REST. He has presented dozens of technology sessions at conferences such as Dreamforce, Oracle OpenWorld, Strata Hadoop World, API World, Microstrategy World, MongoDB World, etc.

Comments

Comments are disabled in preview mode.
Topics

Sitefinity Training and Certification Now Available.

Let our experts teach you how to use Sitefinity's best-in-class features to deliver compelling digital experiences.

Learn More
Latest Stories
in Your Inbox

Subscribe to get all the news, info and tutorials you need to build better business apps and sites

Loading animation