a team of Site Reliability Engineers responsible for delivering high availability, security, and performance across multiple..., guiding team execution in incident response, automation, observability, environment management, and operations. The Manager...
a team of Site Reliability Engineers responsible for delivering high availability, security, and performance across multiple..., guiding team execution in incident response, automation, observability, environment management, and operations. The Manager...
's Product Reliability Engineering (PRE) Observability team partners with Product Development as well as Operations... a Staff Site Reliability Engineer (SRE) with comprehensive knowledge and practical experience. This position requires an I6...
). Demonstrated experience in DevOps or Site Reliability Engineering (SRE) practices for high-traffic environments. Preferred... Secret Manager. Build and maintain comprehensive monitoring and observability solutions using Prometheus, Thanos...
(Terraform, Ansible). Experience with CI/CD pipelines (Jenkins, GitLab CI/CD,). Ownership approach to engineering and product... development cycles. What you will be doing: Site Reliability Engineer will play a critical role in driving innovation...
, and reliability engineering principles (SLOs, error budgets, incident response). Understanding of database technologies including SQL... countries around the world. Job Description The Staff Data Reliability Engineer is responsible for designing, building...
, and reliability engineering principles (SLOs, error budgets, incident response). Understanding of database technologies including SQL... countries around the world. Job Description The Staff Data Reliability Engineer is responsible for designing, building...
or equivalent experience in the related field. 3+ years managing Engineering teams. 8+ years of experience as a site reliability...., CloudFormation, Terraform, Chef) Knowledge of observability tools such as LogicMonitor, New Relic, Prometheus, and Coralogix...
infrastructure settings Comprehensive knowledge of Site Reliability Engineering (SRE) principles, including SLIs/SLOs, error budgets..., and incident management processes Proficiency in Infrastructure as Code practices (e.g., Terraform, Deployment Manager) and CI/CD...
and maintain automation pipelines for DNS, certificates, and infrastructure provisioning using Terraform, Ansible, Python, and Bash.../certificates. · Hands-on experience with automation tools (Terraform, Ansible, CI/CD, GitOps). · Proficiency in Linux...
Senior Lead SRE Engineer Role Summary We’re looking for a Senior Lead Site Reliability Engineer to own and improve... the reliability, performance, and security of our NGINX-based platforms. This role combines hands-on engineering...