Data & AI

Are You Ready to Go Fishing In a Data Lake

by Mike Johnson Posted on October 22, 2015

Data lakes take advantage of data storage techniques for massively scalable, low-cost storage of data files in any format.

Harnessing the Tidal Wave of Data

In our last blog, we talked about our data-dependent universe and how companies and people that make use of all this data do well. Look at Amazon. Amazon.com, a huge online retailer, started out as a bookstore. We buy everything from Amazon.com, right? Airbnb took in over 10 million bookings for lodging last year, and they don’t have a single storefront. There's also Hulu, Apple, and more. But the businesses that don’t embrace these new technologies and new customer engagement models, they bite the dust. Borders, Blockbuster—a few years ago, there was one in every mall, now they’re gone. Kodak, an American giant, went bankrupt.

Data Flows and Data Streams = Data Lakes

Back to tech-savvy companies. What many of them are starting to do is create data lakes. Basically, with a data lake, you have all these streams of information flowing in so the data is stored in one location. However, if you look a little more closely, below the surface, you will see that these streams are coming in from many different repositories.

Data Scientists as Fishermen

You might have a Mongo data stream or a SQL Server data stream, and Hadoop is part of this, too. On one side you have all these data scientists. They’re your fishermen, the guys analyzing the bits and pieces, trying to find the next bit of information to pull out of the lake as they look for relevant facts that’ll help you run your business. You have data generalists and programmers who tap into the streams for real-time analytics, or write your everyday code or use your BI tools to pull information that supports decision-making. Then on the other side, specific data is pulled and treated before going into your data warehouse—the place where you keep all of your clean stuff that you don’t want to mix with anything else. This is one of the big data processes we see companies pursuing in order to pull and work with the most valuable data for business.

Blog 5 Data Lakes Image

Fastest data access in the industry

The drivers my team and I build here at Progress® DataDirect® are designed to be the fastest in the industry, and we stand behind that claim with award-winning technical support. When you find performance degradation issues in our software, we treat them as defects because we understand how important speed is to your business.

See for yourself

You can pick up a free trial today and try it out for yourself, or watch a replay of our webinar, “Industry Insight: Optimizing Your Data for Better Performance.” Don’t forget to check back for the next installment of this series!

Author’s note:

My good friend and colleague Jesse Davis has moved on to an exciting opportunity outside of Progress DataDirect so I’ll be finishing up this series. I’ve been a part of the DataDirect organization for over 22 years supporting and developing our industry leading products as well as leading the teams that build them. Stay tuned for more from me in this series!

Mike Johnson

Mike is a proven leader with over 20 years of experience in developing commercial software for the industry leader in standards-based data access software. He has extensive experience in all aspects of commercial software development including requirements analysis, developing functional requirements, developing and mentoring individuals, staffing, budgeting, product development, quality assurance, training and customer communication. Mike has progressed in his career in large part from his strong work ethic and a “do whatever it takes” attitude.

Related Tags

analytics big data data lake integration

Boost Your Post M&A Success: Embrace Integration

The period after landing a deal is an important time to build connections, establish trust and implement an integration plan.

Company and Community

Karen Williams March 24, 2023

Delivering Relevant Notifications When Monitoring Complex Systems and Applications

Corticon.js helps deliver relevant notifications in complex systems and applications monitoring.

Digital Experience

Thierry Ciot January 12, 2023

Progress to Acquire NoSQL Database Pioneer, MarkLogic

Progress strengthens our already strong position in application platforms and data connectivity by adding a NoSQL database and semantic metadata to our portfolio.

Company and Community DataDirect MarkLogic Mergers and Acquisitions Progress in the News

Yogesh Gupta January 03, 2023

Are You Ready to Go Fishing In a Data Lake

Harnessing the Tidal Wave of Data

Data Flows and Data Streams = Data Lakes

Data Scientists as Fishermen

See for yourself

Author’s note:

Mike Johnson

Related Tags:

Related Products:

DataDirect

Related Tags

Related Articles

Are You Ready to Go Fishing In a Data Lake

Harnessing the Tidal ​Wave of Data

Data ​Flows and ​Data ​Streams = ​Data Lakes

Data Scientists as Fishermen

See for yourself

Author’s note:

Mike Johnson

Related Tags:

Related Products:

DataDirect

Related Tags

Related Articles

Latest Stories in Your Inbox

Harnessing the Tidal Wave of Data

Data Flows and Data Streams = Data Lakes