Speaker: Cooper Bethea
He / him / his
Formerly Senior Staff Engineer and Technical Lead @Slack, Previously SRE Lead and SRE Workbook Author @Google
Cooper is a software engineer and site reliability expert with 17 years experience working on improving the reliability of large-scale distributed systems. Most recently, as Senior Staff Software Engineer at Slack, Cooper led the Cellular Slack project, a major rearchitecting initiative that significantly enhanced the platform's fault tolerance and disaster recovery capabilities.
Previously at Google, Cooper served as Reliability Lead for the Global Cloud Load Balancer and was the lead author for the “Managing Load” chapter of the SRE Workbook. His career spans roles at Foursquare and Sift, where he held responsibility for the availability of all user-facing infrastructure.
Cooper is passionate about building scalable, resilient systems and sharing knowledge within the tech community. His talks draw from nearly two decades of hands-on experience with some of the industry's most demanding infrastructure environments.
Find Cooper Bethea at:
Session
Slack's Migration to a Cellular Architecture
Cellular service architectures are a conceptually simple way for highly available online services to limit the impact of cascading failures and improve scale-out. So why aren't we all using them? And how do they even work in practice?