Skip to content

Recommended Resources

The following books provide depth and perspective that complements the whole course, not just individual weeks. None of them are required reading, but each one rewards the investment.

  • UNIX and Linux System Administration Handbook by Evi Nemeth, Garth Snyder, Trent Hein, and Ben Whaley: the definitive reference for Linux and Unix system administration, covering everything from the command line to networking, storage, security, and large-scale operations.
  • Site Reliability Engineering: How Google Runs Production Systems edited by Betsy Beyer, Chris Jones, Jennifer Petoff, and Niall Richard Murphy: Google’s collected practices for building and operating reliable systems at scale, covering automation, monitoring, incident response, and postmortems. Available free at sre.google/sre-book.
  • The Site Reliability Workbook: Practical Ways to Implement SRE edited by Betsy Beyer, Niall Richard Murphy, David K. Rensin, Kent Kawahara, and Stephen Thorne: a companion to the SRE book with practical exercises and case studies for implementing SRE principles in your organization. Available free at sre.google/workbook.
  • The DevOps Handbook by Gene Kim, Patrick Debois, John Willis, and Jez Humble: the practical guide to DevOps transformation, covering flow, feedback, and continuous learning across development and operations.
  • The Phoenix Project by Gene Kim, Kevin Behr, and George Spafford: a novel that illustrates DevOps principles through a fictional IT crisis. A useful companion to the DevOps Handbook that makes the cultural and organizational ideas concrete.