Resources
Blog Posts and Journals
- The Modern Data Stack Repository
- Medium Blog Posts - Data Engineering
- Start Data Engineering Blog Posts
- High Scalability
- The GitHub Blog
- Engineering at Quora
- Yelp Engineering Blog
- Twitter Engineering
- Facebook Engineering
- Yammer Engineering
- Etsy Code as Craft
- Foursquare Engineering Blog
- Airbnb Engineering
- WebEngage Engineering Blog
- LinkedIn Engineering
- The Netflix Tech Blog
- BankSimple Simple Blog
- Square The Corner
- SoundCloud Backstage Blog
- Flickr Code
- Instagram Engineering
- Dropbox Tech Blog
- Cloudera Developer Blog
- Bandcamp Tech
- Oyster Tech Blog
- THE REDDIT BLOG
- Groupon Engineering Blog
- Songkick Technology Blog
- Google AI Blog
- Google Developers Blog
- Pinterest Engineering Blog
- Twilio Engineering Blog
- Bitly Engineering Blog
- Uber Engineering Blog
- Godaddy Engineering
- Splunk Blog
- Coursera Engineering Blog
- PayPal Engineering Blog
- Nextdoor Engineering Blog
- Booking.com Development Blog
- Microsoft Engineering Blog
- Scalyr Engineering Blog
- Myntra Engineering Blog
- Fastly Blog
- AWS Architecture Blog
- Lyft Engineering Blog
- Wish Engineering
- Doordash Engineering
- SnowFlake Blog
- Palantir Blog
- Awesome Data Engineering
Data Engineering
- 97 Things Every Data Engineer Should Know
- Data Engineering with AWS [code]
- Data Engineering with Google Cloud Platform [code]
- Scalable Data Streaming with Amazon Kinesis [code]
- Fundamentals of Data Engineering
- Designing Data-Intensive Applications [code]
- Data Engineering with Python [code]
- Simplifying Data Engineering and Analytics with Delta [code]
- Azure Data Engineering Cookbook [code]
- Data Engineering with Apache Spark, Delta Lake, and Lakehouse [code]
- Data Pipelines Pocket Reference [code]
- Serverless Analytics with Amazon Athena [code]
- Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications [code]
- Apache Spark 3 for Data Engineering and Analytics with Python [code]
- Data Pipelines with Apache Airflow
Spark
- Mastering Big Data Analytics with PySpark [code]
- PySpark Cookbook [code]
- Learning Spark, 2nd Edition [code] [Alternative]
- Spark: The Definitive Guide [code]
- Spark Programming in Python for Beginners with Apache Spark 3 [code]
- Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library
- Data Algorithms with Spark [code]
- Scaling Machine Learning with Spark
- Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications
- Apache Spark 3 Advance Skills for Cracking Job Interviews
- Apache Spark 3 for Data Engineering and Analytics with Python [code]
- Advanced Analytics with PySpark [code]
- Spark in Action, Second Edition [code]
- High Performance Spark [code]
- Real-Time Stream Processing Using Apache Spark 3 for Python Developers [code]