Streaming
Processing Real-Time …
The first part of this post can be found at Processing Real-Time Streams in Databricks – Part 1.
This is the continuation of Part 1 — we won’t repeat the architecture and setup. In this section we go into Azure Databricks and create the cluster and notebook to ingest data in real-time, …
Processing Real-Time …
Databricks is becoming the new normal in data processing technologies in cloud, both Azure and AWS. This is a step-by-step guide to get started on real-time (streaming) analytics using Spark Streaming on Databricks.
Architecture
The demo was built to show the speed layer (hot path) of a typical …
Kappa Architecture – …
Kappa architecture was proposed by Jay Kreps (co-creator of Apache Kafka) as a simplification of the Lambda architecture. The core idea: remove the batch layer entirely and treat everything as a stream.

The Core Concept
In Lambda architecture, you maintain two separate processing paths — batch and …
Introduction to Lambda …
Lambda architecture is a data processing architecture designed to handle massive quantities of data by taking advantage of both batch and stream processing methods. The architecture was introduced by Nathan Marz and is based on three layers: the Batch Layer, the Speed Layer, and the Serving Layer. …




