Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior AI and HPC Observability Engineer, Location: Santa Clara, CA

Page: 1

Senior AI and HPC Observability Engineer

. We are looking for a strong AI & HPC Observability Engineer to build and scale next-generation Observability and Telemetry platforms. You will design... designing and scaling observability platforms for AI, GPU, or HPC environments Hands-on expertise with OpenTelemetry...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Mar 2026

Senior Site Reliability Engineer - HPC

) that deep‑dive into real‑world reliability, observability, or large‑scale HPC/SRE problems and their solutions. Maintainer.... We’re looking for a Senior SRE to join our Compute Farm team and help build the next generation of our global services...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 27 Feb 2026

Senior Software Engineer, Observability

NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation, multi-region... while supporting high-volume workloads (AI/ML, HPC clusters, GPU infrastructure) Embedding security guidelines into observability...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 10 Dec 2025

Senior System Software Engineer, Firmware

architect for a Senior System Engineer role for system bringup and datacenter applications. Be a key player to the most exciting.... You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 05 Dec 2025

Senior Software Engineer - Storage

some of the world’s most advanced computing workloads. We are seeking a Software Engineer to join our MARS team at NVIDIA... improvements in system reliability, performance, and observability to meet exascale standards. Partner with security, networking...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 21 Feb 2026