💊 DATA Pill #137 - Your Top Picks of 2024!
Hi,
Welcome to DATA Pill #137! This edition is all about your favorites—the most-read articles, tutorials, and insights of 2024. From self-service data mesh to real-time analytics and AI breakthroughs, these are the resources you loved the most.
Let’s dive in!
ARTICLES
How Pfizer Achieved Self-Service Data Mesh with Snowflake and Azure | 20 min | Data Engineering | Samia Rahman, Marty Hall, Gary Kretzschmar, Christopher Witcher, Jennifer Yoakum, Matthew Massey | Snowflake Blog
This article delves into the strategic deployment of data mesh on platforms like Snowflake and Azure, offering insights from those who have successfully navigated the journey. Through the lens of Pfizer's Data Strategy, Science, and Solutions team, let's explore the pivotal shifts necessary to achieve a robust self-service data ecosystem, illustrating the challenges and triumphs along the way.
What Is a Streaming Database? | 6 min | Real-time analytics | RisingWave Labs | Towards Dev
Streaming databases are designed to process and store large volumes of real-time data, enabling immediate analysis and insights. Unlike traditional batch-processing databases, they handle continuous data flow and are ideal for time-sensitive applications like fraud detection and IoT. These databases support real-time analytics by combining immediate data processing with persistent storage.
In MORE LINKS you will read:
Recommended by LinkedIn
TUTORIALS
Real-time Analytics: architecture, technologies and example implementation in e-commerce | 6 min | Real-time analytics | Piotr Pękala | GetInData | Part of Xebia Blog
This blog delves into how real-time analytics can transform data collection, transformation, and analysis to provide immediate insights and actionable information, focusing on e-commerce implementation.
PODCAST
No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla | 44 min | Gen AI | Andrej Karpathy, Sarah Guo, Elad Gil | No Priors Podcast
Andrej Karpathy, former Tesla Autopilot leader and OpenAI founding member, joins to discuss self-driving cars, Tesla's Optimus robot, and AI's future. He also shares insights on AI education and his new mission, Eureka Labs.
DATA TUBE
Realtime Streaming with Data Lakehouse - End to End Data Engineering Project | 1h | Streaming | CodeWithYu
How to design, implement and maintain secure, scalable and cost effective lakehouse architectures leveraging Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools.
____________________
Have any interesting content to share in the DATA Pill newsletter?
➡ Join us on GitHub
➡ Dig previous editions of DataPill
Adam from the GetInData | Part of Xebia
Director of Developer Community at RisingWave Labs
4dGlad to see What Is a Streaming Database among your top picks! 🤩 Happy New Year and keep up with great work Adam Kawa!