Skip to content

data-engineering-helpers/data-lakehouse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 

Repository files navigation

Data-lakes, data warehouses and data lake-houses

Table of Content (ToC)

Created by gh-md-toc

Overview

This project intends to collect, analyze and synthetize referential material about data-lakes, data warehouses and data lakehouses.

Even though the members of the GitHub organization may be employed by some companies, they speak on their personal behalf and do not represent these companies.

Other repositories of Data Engineering helpers

References

Awesome lakehouse guide

DataBricks blog - What is a data lakehouse

Snowflake guides - What is a data lakehouse

Google Cloud - What is a data lakehouse

What is Apache XTable

Articles

Apache Arraw ecosystem

ACID Transactions in an Open Data Lakehouse

Why are companies building a lakehouse

"Why are companies building a Lakehouse"? This is how I responded... The why is simple:

  1. Reduce costs
  2. Eliminate lock-in
  3. Be more agile and flexible

From Lakehouse architecture to data mesh

Open Table Formats and the Open Data Lakehouse

Understanding Parquet, Iceberg and data lake-houses at broad

The Data Lakehouse: Data Warehousing and More

Understanding Big Data File Formats

Frameworks

PostgreSQL extensions

pg_lake

pg_incremental

About

Knowledge sharing - Material about data-lakes, data warehouses and data lake-houses

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors