⚙️

Site Reliability Engineer

Also known as: SRE, Platform Engineer, DevOps Reliability Engineer, Infrastructure Reliability Engineer

AI Impact Score

5/100

AI is accelerating incident detection, runbook automation, and anomaly identification, but SREs who understand complex distributed systems remain essential. Those who wield AI for faster incident response and capacity planning will be invaluable.

$110k – $200k

Salary Range

booming

Growth Outlook

180,000

Total Jobs (US)

+22%

Growth Rate

Task Breakdown

Tasks at Risk (4)

Writing standard runbook documentationManual log analysis for known error patternsRoutine capacity planning calculationsBasic alert threshold configuration

AI-Enhanced Tasks (4)

Anomaly detection using ML-powered observability platformsAutomated incident triage and correlationAI-assisted postmortem analysis and action item generationPredictive capacity planning with traffic forecasting models

Human-Safe Tasks (4)

Designing distributed systems architecture for reliabilityNovel incident investigation requiring systems thinkingDefining SLOs and error budget policyCultural change toward reliability across engineering teams

Current Skills

Linux systems administrationCloud platforms (AWS, GCP, Azure)Kubernetes and container orchestrationObservability (Prometheus, Grafana, Datadog)Incident management and on-call practices

Future-Proof Skills

AI-powered observability and AIOps platformsSLO-based reliability engineeringChaos engineering and resilience testingFinOps and cloud cost optimizationPlatform engineering and internal developer platforms