May be filled
Software Engineer - Site Reliability Engineer (SRE)
Pittsburgh, PA
2025-11-22
AI Summary
Powered by ClaudeAs an SRE at Lovelace AI, you will play a critical role in ensuring the availability, scalability, and performance of our cutting-edge AI-powered applications and infrastructure. You will bridge the gap between software development and operations, applying sound engineering principles and automation to maintain and improve our systems.Key Responsibilities:Design, implement, and maintain robust monitoring, alerting, and observability solutions to proactively detect and resolve issues before they…
Job description
As an SRE at Lovelace AI, you will play a critical role in ensuring the availability, scalability, and performance of our cutting-edge AI-powered applications and infrastructure. You will bridge the gap between software development and operations, applying sound engineering principles and automation to maintain and improve our systems.Key Responsibilities:Design, implement, and maintain robust monitoring, alerting, and observability solutions to proactively detect and resolve issues before they impact end-users.Lead troubleshooting efforts for complex production issues, providing detailed root cause analysis (RCA) and implementing preventative measures.Develop and maintain automation scripts, build systems (Bazel) and infrastructure as code (IaC) using tools like Terraform, Ansible, or CloudFormation to eliminate manual tasks and improve system reliability and efficiency.Collaborate closely with software engineering teams to influence the design of new services and applications, ensuring they are scalable, reliable, and resilient from the outset.Participate in on-call rotations to respond to platform eme...
Get a weekly digest of similar roles
Save this search for Software Engineer - Site Reliability Engineer (SRE) in Pittsburgh, PA and get the strongest matches every week.
Privacy-first. Unsubscribe anytime.