Skip to content

datadriven-io/awesome-data-engineering-interview

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Awesome Data Engineering Interview Awesome

Curated resources for data engineering interview prep. Books, blogs, lessons, problem banks, courses, and tools.

License PRs welcome Sandbox


Every entry on this list is hand picked. No filler. If a resource is not the best in its category, it is not here. PRs welcome to add anything stronger.

Contents

Books

Question banks

Lessons and tutorials

System design

Cheatsheets

Company guides

Company Guide Distinctive
Netflix companies/netflix/interview Streaming and OLAP at scale
Uber companies/uber/interview Real time, geo partitioning
Amazon companies/amazon/interview Leadership principles, bar raiser
Google companies/google/interview BigQuery patterns, algorithmic depth
Meta companies/meta/interview Presto, product sense plus DE

Full company index: datadriven.io/companies.

Behavioral

Blogs and newsletters

Tools to know

Category Tool Why it shows up in interviews
Orchestration Airflow The default expectation
Orchestration Dagster, Prefect Modern alternatives, common in tradeoff questions
Transformation dbt Standard for warehouse modeling
Streaming Kafka Standard for event ingestion
Streaming Flink, Spark Structured Streaming Common stream processors
Warehouse Snowflake, BigQuery, Redshift Pick what your target uses
Lakehouse Databricks, Iceberg, Delta, Hudi Frequent in modern stack tradeoff questions
Format Parquet, ORC, Avro Know the tradeoffs
Catalog Unity Catalog, Glue, Polaris Increasingly asked

Longer tooling map: datadriven.io/data-engineering-tools.

Roadmaps and study plans

Communities

Companion repos

Contributing

Open a PR following the awesome list manifesto.

Rules:

  1. One line note per entry, no marketing copy.
  2. Free resources preferred. Paid only if best in category.
  3. No affiliate links.
  4. No dead links.

Run awesome-lint before opening a PR.

License

CC0 1.0. Public domain.

About

A curated awesome list of the best free resources for data engineering interview prep in 2026. SQL, Python, schema design, pipeline architecture, system design, books, blogs, and company guides.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors