Over the years, I’ve read hundreds of industry white papers and articles. Initially, it started with the curiosity to learn about how different systems are built, and then it became about refining my philosophy on what contributes to well-designed systems. While I found it a good use of my time, I was not blind to the fact that memory is not permanent and it slowly loses resolution. I use these summaries to quickly refresh my memory, whenever I am working on new problems.

White Paper ranking

Last Updated: Feb 27, 2024, Live Document

  1. ✅ On designing and deploying internet-scale services [ MS, 2008]
  2. ✅ Scaling Memcache at Facebook [Meta, 2013]
  3. ✅ Amazon Dynamo and DynamoDB → (Amazon 2007, 2022)
  4. ✅ Achieving Rapid Response Times in Large Online Services [ Jeff Dean [2012]]
  5. ✅Spanner: Google’s Globally-Distributed Database [2012]
  6. ✅ FoundationDB: A Distributed Unbundled Transactional Key [Apple, Snowflake, 2021]
  7. Zanzibar: Google Consistent, Global Authorization System [Google, 2019]
  8. ✅The Anatomy of a large-scale hyper-textual Web Search engine [Google,98]
  9. ✅ Paxos Made Simple
  10. ✅ Raft: In search of an understandable consensus Algorithm
  11. ✅ Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google. [Google, 2021]
  12. ✅ Pinot: Realtime OLAP for 530 Million Users
  13. ✅ Procella. Unifying serving and analytical data at Youtube. [Google, 2022]
  14. ✅F1: A Distributed SQL Database That Scales. [Google, 2013]
  15. ✅ Earlybird: Real-Time Search at Twitter [2012]
  16. ✅ Finding a needle in Haystack: Facebook's photo storage [2010]
  17. ✅ The Snowflake Elastic Data Warehouse [ 2016]

Top persistent system rankings

Page collection

✅ On designing and deploying internet-scale services [ MS, 2008]

✅ Scaling Memcache at Facebook [Meta, 2013]

✅ FoundationDB: A Distributed Unbundled Transactional Key [Apple, Snowflake, 2021]

Zanzibar: Google Consistent, Global Authorization System [Google, 2019]

✅The Anatomy of a large-scale hyper-textual Web Search engine [Google,98]

✅ Amazon Dynamo and DynamoDB → (Amazon 2007, 2022)

✅ Paxos Made Simple

✅ Raft: In search of an understandable consensus Algorithm

✅ Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google. [Google, 2021]

✅ Procella. Unifying serving and analytical data at Youtube. [Google, 2022]

✅ Earlybird: Real-Time Search at Twitter [2012]

✅F1: A Distributed SQL Database That Scales. [Google, 2013]

✅ Pinot: Realtime OLAP for 530 Million Users

✅ Achieving Rapid Response Times in Large Online Services [ Jeff Dean [2012]]

✅Spanner: Google’s Globally-Distributed Database [2012]

✅ Finding a needle in Haystack: Facebook's photo storage [2010]

✅ The Snowflake Elastic Data Warehouse [ 2016]