Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Site Reliability / Observability Engineer, Location: USA

Page: 38

Digital Asset Senior Principal Software Engineer

quality, testing, observability, reliability, and performance. Oversee end-to-end delivery processes, including requirements... and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site...

Company: JPMorgan Chase
Location: New York
Posted Date: 04 Mar 2026

Senior Software Engineer

with observability, diagnostics, and live-site operations for mission-critical services. Experience working in environments with limited..., monitoring, and live-site support. Collaborate with cross-functional partners and partner engineering teams) to translate product...

Company: Microsoft
Location: Redmond, WA
Posted Date: 04 Mar 2026

Product Engineer

High-Impact Backend Role | AI-Native Legal Platform New York / On-Site Full-time | Mid–Senior.... This is a product-first backend role where reliability, safety, and extensibility are mission-critical. What You'll Build Core...

Posted Date: 02 Mar 2026

Senior Platform Engineer – Analytics Platform (Databricks | Palantir)

, applying site reliability engineering principles to drive automation, observability, and resilience across the data platform... Required Qualifications 5+ years in platform engineering, data platform operations, site reliability engineering, DevOps, or related roles...

Company: Conagra Brands
Location: Omaha, NE
Posted Date: 28 Feb 2026
Salary: $81000 - 118000 per year

Lead Product Software Engineer - API Platform SRE Engineer

: Minimum 7 years of software related experience required, with a mixture of Site Reliability, DevOps, or Release Engineering... and observability of systems at scale and detect and alert on trends of information. Define metrics to ensure the high performance...

Company: Wolters Kluwer
Location: Waltham, MA
Posted Date: 28 Feb 2026

Principal Software Engineer (Frontend/Full Stack)

-site health: improve observability, monitoring/alerting, incident response, and reduce time-to-diagnosis through systemic... of improving reliability, performance, and operational excellence through observability and systematic engineering practices....

Company: Microsoft
Location: Redmond, WA
Posted Date: 27 Feb 2026

Senior Software Engineer – AI/ML Infrastructure

. Integrate AI systems with code repositories, CI/CD pipelines, observability tools, and security/compliance frameworks to enhance... reliability and performance. Drive best practices, design reviews, and technical direction, ensuring data governance, security...

Company: Cisco Systems
Location: Milpitas, CA
Posted Date: 26 Feb 2026

Principal Software Engineer, Experimentation Platform - CoreAI

delivery schedules, drive alignment across partner teams, and ensure proper end-to-end testing, live-site coverage, scalability..., production reliability, and security hardening for both protections and detections. Hold accountability as a designated...

Company: Microsoft
Location: Redmond, WA
Posted Date: 26 Feb 2026

Principal/Senior Software Engineer, Experimentation Platform - CoreAI

, reliability, fault tolerance, and cost optimization. Experience using observability tools (logging, metrics, distributed tracing..., security best practices, and deployment infrastructure. Maintain operations of live site services on a rotational on-call basis...

Company: Microsoft
Location: Redmond, WA
Posted Date: 26 Feb 2026

(Senior) Software Engineer, Experimentation Platform - CoreAI

observability tools (logging, metrics, tracing) to diagnose service issues and improve system reliability. Experience.... Build extensible, maintainable services and features with strong diagnosability, reliability, and production-readiness...

Company: Microsoft
Location: Redmond, WA
Posted Date: 26 Feb 2026

Bare Metal Kubernetes Engineer

. Guarantee Reliability and Security: Define and meet rigorous SLIs/SLOs by engineering robust observability stacks (Prometheus... architectures. Observability & Reliability Mindset: Experience building comprehensive monitoring and logging frameworks (ELK...

Company: Pure Storage
Location: Santa Clara, CA
Posted Date: 26 Feb 2026

Senior Software Engineer

. Ensure secure, high-quality product delivery, overseeing system architecture and code quality. Champion Live Site culture..., ensuring reliability and customer delight and mentor engineers, shaping the vision for agentic AI-powered work management. Seek...

Company: Microsoft
Location: Redmond, WA
Posted Date: 21 Feb 2026

Senior Software Engineer

improvements across agentic workflows. Oversee Live Site operations for agentic systems, ensuring reliability, rapid incident... for agent interoperability, real-time processing, and fault tolerance. Drive performance optimization and observability...

Company: Microsoft
Location: Redmond, WA
Posted Date: 20 Feb 2026

Senior DevOps Engineer

advanced deployment and support of enterprise software solutions, digital intelligence (monitoring and observability... their development and deployment processes. Mentor junior engineers on automation, observability, and continuous delivery concepts...

Company: KeyBank
Location: Brooklyn, OH
Posted Date: 20 Feb 2026
Salary: $71000 - 125000 per year

Principal DevOps Engineer- Azure

, and deployment of applications System Reliability and Scalability: Implementing Site Reliability Engineering (SRE) principles... to enhance system reliability, availability, and performance Monitoring and Optimization: Implementing monitoring...

Company: AT&T
Location: Plano, TX
Posted Date: 20 Feb 2026

AI Infrastructure Engineer

, reliability, and scalability of AI platforms. Implement observability for agentic AI systems to ensure reliability, transparency... for its people and its customers. Respect for both work and play, with vehicles that are equally at home at a camp site...

Company: Scout Motors
Location: USA
Posted Date: 19 Feb 2026
Salary: $120000 - 145000 per year

Software Engineer II

performance optimization and observability improvements across agentic workflows. Oversee Live Site operations for agentic systems..., ensuring reliability, rapid incident response, and continuous improvement. Collaborate with partner engineering teams to build...

Company: Microsoft
Location: Redmond, WA
Posted Date: 17 Feb 2026

Staff Data Engineer - AIOps

with infrastructure, platform, security, and product teams to embed AI capabilities into operational systems, observability platforms..., reliability engineering, and automation workflows Conducts architecture and design reviews for AI platforms, data systems, and ML...

Company: American Express
Location: Phoenix, AZ
Posted Date: 14 Feb 2026

Software Engineer Sys 2

GenAI observability pipelines to track trace-level data, prompt inputs and outputs, and model latency. Collaborate closely..., embeddings, reranking) and context engineering focusing on reliability, cost, and latency optimization. Strong agent design...

Company: Lam Research
Location: Fremont, CA
Posted Date: 13 Feb 2026
Salary: $86000 - 183000 per year

AI First Software Engineer

collaborates closely with Product Managers, Data Engineers, Infrastructure, and Site Reliability Engineering (SRE) teams on a daily.... Ownership-driven. You take responsibility for systems end-to-end, including reliability and operational health. Technically...

Company: Openlane
Location: USA
Posted Date: 13 Feb 2026
Salary: $110000 - 130000 per year