APACHE ICEBERG — DEEP DIVE I: ARCHITECTURE

Douglas Saldanha de Souza

Engenharia de Dados | PySpark | Python | SQL | Databricks | Spark| AWS | Analytics Engineer

Published Dec 26, 2024

+ Follow

Hello everyone!

We're back with another edition of Dataletter, the second to last of the year!

Over the last few weeks I've been learning about the new table formats, right after Parquet.

There's an article here.

After the in-depth study, I moved on to Apache Iceberg, which is an open source format for Delta Lake.

Having been motivated by its adoption by Amazon, I did an in-depth study of its layered architecture, how it is divided up and how it works.

So, if you want to learn more, check out the full article on the blog.

LINK TO ARTICLE HERE

Dataletter: Data Engineer News

324 followers

+ Subscribe

To view or add a comment, sign in

APACHE ICEBERG — DEEP DIVE I: ARCHITECTURE

Douglas Saldanha de Souza

Engenharia de Dados | PySpark | Python | SQL | Databricks | Spark| AWS | Analytics Engineer

Dataletter: Data Engineer News

324 followers

More articles by Douglas Saldanha de Souza

Insights from the community

Others also viewed

July 2022

eCHO News 34

10-January-2022 Streaming

How to Optimize SpringBoot Application and improve its efficiency

Ensuring Consistency in Distributed Systems: The Role of Consensus Algorithms ?

In Person at All Things Open (Raleigh USA 2023)

Iceberg - The Cloudera Way

start-notebook-in-specific-cluster

KubeCon NA 2022 - DOK Day

Deploy HA Kubernetes cluster using Kubesphere on Almalinux9

Explore topics

Dataletter: Data Engineer News

324 followers

More articles by Douglas Saldanha de Souza

APACHE ICEBERG — DEEP DIVE II: HOW READ & WRITE OPERATIONS WORKS

Parquet - Internals: Um Estudo Detalhado.

Data Lake com Hadoop: Final

Como criar um Delta Lake com Hadoop I

Criando Cluster Spark com Docker

SQL: Manipulação de Dados

Fontes de Dados: Spark & Databricks

Ambiente de Homologação com DBT & SQL Server.

CDO: Começo da Trajetória

DBT: Crie Modelos & Fontes de Dados.

Insights from the community

Others also viewed

July 2022

eCHO News 34

10-January-2022 Streaming

How to Optimize SpringBoot Application and improve its efficiency

Ensuring Consistency in Distributed Systems: The Role of Consensus Algorithms ?

In Person at All Things Open (Raleigh USA 2023)

Iceberg - The Cloudera Way

start-notebook-in-specific-cluster

KubeCon NA 2022 - DOK Day

Deploy HA Kubernetes cluster using Kubesphere on Almalinux9

Explore topics