. Mentor less experienced engineers, fostering technical growth and a collaborative culture. Partner with SRE, hardware... systems concepts, and system reliability. Demonstrated experience building resilient and performant software in production...
looking to improve speed, consistency, and reduce manual work. Reliability Engineering: Apply SRE principles to ensure high...
across a global enterprise. It is a platform operations and enablement role -- not DevOps application delivery, production SRE, CI/CD... or microservices engineering, nor full-time software development. WHAT YOU'LL BE DOING: Own platform governance, reliability...
data for trends, improvements, and reliability risks, proposing remediation plans. Proactively engage customers on SLO... for customers. Extensive experience with monitoring tools and platforms Advanced certifications in SRE or related fields. Experience...
, and prioritization Share updates early and often Own the architecture and reliability of our core systems as we scale What Success... downtime during open enrollment Features ship in days, not weeks Key SLAs met (performance, reliability, latency) with no...
services. Experience teaching reliability engineering (e.g. SRE) and/or other scale-oriented cloud systems practices to peers..., accelerated compute infrastructure and codify reliability best-practices in the broader DGX Cloud platform ecosystem...
, visit , or follow us on Twitter @ and . Job Description Join our DevOps/SRE team as a key contributor responsible... control. Reliability & Support Troubleshoot production issues and support on-call rotations. Contribute to incident...
, and ensuring scalability, reliability, and long-term technical excellence. Key Responsibilities: Architecture & Technical... with DevOps/SRE teams on: CI/CD pipeline architecture Observability (metrics, logging, tracing) Auto-scaling, resilience...
, and ensuring scalability, reliability, and long-term technical excellence. Key Responsibilities: Architecture & Technical... with DevOps/SRE teams on: CI/CD pipeline architecture Observability (metrics, logging, tracing) Auto-scaling, resilience...
engineering teams, and ensuring scalability, reliability, and long-term technical excellence. Key Responsibilities... Docker and Kubernetes. Collaborate with DevOps/SRE teams on: CI/CD pipeline architecture Observability (metrics, logging...
on development and focuses on defining architecture, guiding engineering teams, and ensuring scalability, reliability, and long-term... and orchestration strategies using Docker and Kubernetes. Collaborate with DevOps/SRE teams on: CI/CD pipeline architecture...
SRE practices including SLIs, SLOs, error budgets, and reliability-driven engineering decisions. Provide L3/L4 incident...Job Title: Senior Java SRE Experience Required: 15+ years Assignment Duration: 12+ Months Engagement Type: Contract...
Job Title: Senior Java SRE Work Location: McLean, VA (OR) Santa Monica, CA - Onsite Experience Required: 15+ years... available, fault-tolerant systems supporting core banking, payments, and trading platforms. Lead SRE practices including SLIs, SLOs...
GBaMS ReqID:10525316 Role Descriptions: Role Summary We are looking for a Mid-Level Observability Engineer to help.... You will collaborate with application| SRE| and operations teams to ensure systems are observable| supportable| and production-ready. Key...
-on CMTS/Senior Staff Engineer or Solution Architect with deep expertise in database internals, query optimizers, tune.... You will partner with product, SRE, data, and application teams to architect resilient, cost-efficient, and high-performance data...
GBaMS ReqID:10525316 Role Descriptions: Role Summary We are looking for a Mid-Level Observability Engineer to help.... You will collaborate with application| SRE| and operations teams to ensure systems are observable| supportable| and production-ready. Key...
, and maintain high performant, distributed, fault tolerant systems. As a DevOps Engineer you will be filling a mission-critical role... systems by pushing for changes that improve capacity and reliability. Practicing sustainable incident management...
), DevOps (e.g., AWS DevOps Engineer), or related areas (e.g., Google Cloud Professional Data Engineer). Experience in SRE... FinOps Practitioner to join our SRE team. At Domo FinOps is embedded within our SRE practice to drive cost optimization...
Infrastructure (OCI) Gen 2. As an IC3 engineer, you will design, build, and operate provisioning workflows and services that create..., scale, patch, and heal Fusion environments across regions and pillars, with a focus on reliability, security, and cost...
for our guests and partners. We embrace the philosophies of Agile, DevOps, and SRE to accelerate our development process and provide... is collaborative, as we focus on the future instead of the past. As an Engineer, you will work as part of a global team supported...