Blog

How to Structure the Data Lake
How to Structure the Data …

A data lake is a framework, concept, and guidance on where to place data (Microsoft named their product Azure Data Lake, but the concept is broader). From a technology point of view, it suggests storing all data in object-oriented or hierarchical storage. This is the concept of data locality — data …

5 Things to Get the Best Out of Azure Databricks
5 Things to Get the Best …

Most of the customers I talk to are directly or indirectly asking to scale their workloads and use Databricks. It has become the new normal in data processing in cloud. If you are using or plan to use Azure Databricks, this post will guide you on some interesting things to investigate as you start. …

Analytics Maturity (Part 1) – Introducing the Chasm
Analytics Maturity (Part …

“The world’s most valuable resource is no longer oil, but data.” — The Economist

Organizations are challenged daily to come up with innovative business models, use data at the core of the business, and make more data-driven decisions.

I’ve seen a few analytics maturity …

Introduction to Distributed Computing
Introduction to …

Distributed computing technology enables the compute load to be spread, or distributed, across multiple nodes (computers) connected via a network. The networked machines share the same goal and share the compute load to effectively collaborate and provide the resources to obtain that goal.

Early …