Engineering
Site Reliability Engineer
Keeps the platform up at 3am — and prevents the next outage.
SREs are the discipline (originating at Google) that treats reliability as an engineering problem. They build the monitoring, automation, and incident response systems that keep large platforms running. When something breaks at 3am, they're on the bridge. When nothing breaks, they're automating away the next failure.