Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Data Lake / Lakehouse Guide: Data Lake Table Formats (Delta Lake, Iceberg, Hudi) (airbyte.com)
3 points by sspaeti on Aug 25, 2022 | hide | past | favorite | 1 comment


I created a small guide with the most important topics on a Data Lake and Lakehouse containing the following chapters.

- What is a Data Lake & Why do you need one? - Differences between Lakehouse & Warehouse - Components of Data Lake - Storage Layer (AWS S3, Azure Blob Storage, Google Cloud Storage) - File Format (Apache Parquet, Avro, ORC) - Table Format (Delta Lake, Apache Hudi, and Iceberg) - Trends in the Market - How to turn it into a Lakehouse

I hope that's interesting to one or the other. Curious to hear your thoughts and opinions. What's your Data Lake Table Format of choice, and why?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: