Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: AI Inference Performance Engineer, Location: Santa Clara, CA

Page: 4

Senior Deep Learning Software Engineer

We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 31 Jan 2026

Senior Software Engineer, Metropolis Vision AI

of a strategic platform with high visibility and real-world impact. As a System Software Engineer for Vision AI, you will develop... and optimize high-performance vision systems that turn massive streams of video, image, and 3D data into actionable insights...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 31 Jan 2026

Staff Software Engineer - AI/ML

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw.... Design and develop scalable, maintainable, and reusable software components with a strong emphasis on performance...

Company: ServiceNow
Location: Santa Clara, CA
Posted Date: 31 Jan 2026

Sr Staff Software Engineer (DLP)

automated ML pipelines and optimized performance across data ingestion, model serving, and inference workflows Execute on the... of cloud security utilizing big data and analytics. We are looking for an Engineer to join the team that is building our latest...

Location: Santa Clara, CA
Posted Date: 30 Jan 2026

Principal Machine Learning Platform Engineer (Prisma AIRS)

from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority... of our AI platform - ML inference. Beyond individual contribution, you will lead complex technical projects, mentor senior engineers...

Location: Santa Clara, CA
Posted Date: 30 Jan 2026

Machine Learning Engineer, GeForce G-Assist

to runtime performance. Optimize local inference using llama.cpp, including quantization, memory usage, and performance tuning.... Read, write, and optimize C/C++ code in performance-critical paths. Design and integrate retrieval-augmented generation...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 30 Jan 2026

Senior Software Test Development Engineer - Deep Learning

We are looking for a Software Test development engineer in NVIDIA’s Deep Learning SWQA team. The position is in NVIDIA... and measure the performance of NVIDIA‘s Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 30 Jan 2026

Senior Deep Learning Algorithm Engineer

NVIDIA’s GPU Workload Efficiency (GWE) team is looking for a skilled Senior Engineer to enhance performance in training... understanding of training and inference constraints. Proven ability in GPU performance analysis and profiling, with hands...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Senior Developer Technology Engineer - Windows AI Platform

world. As a Developer Technology Engineer, you will be at the forefront of innovation, working with leading industry... resulting in suboptimal runtime performance. Conduct hands-on trainings, develop sample code and host presentations to give...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Senior Staff Software Engineer - AI/ML

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw... scalable, secure, and high-performance AI solutions. Build high-quality, clean, scalable and reusable code by enforcing...

Company: ServiceNow
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Senior Deep Learning Framework Communications Engineer

GPUs to inference down at microsecond latency. Communication performance between the GPUs has a direct impact...NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Distinguished Engineer, Switch Architect

and packet processing Scaling networks in AI training and inference fabrics Proven hands-on experience with performance... shapes product definition, behavioral modeling, device performance modeling, and architectural validation. We work at the...

Company: Marvell
Location: Santa Clara, CA
Posted Date: 23 Jan 2026

Software Engineer

and change any time whether you accept cookies or choose to opt out of cookies to improve website's performance, as well... Jobs Job Description Software Engineer Job Location: Santa Clara, California Location Flexibility: Multiple Locations in Country Req Id: 3848...

Company: Fujitsu
Location: Santa Clara, CA
Posted Date: 22 Jan 2026

AI Cluster Validation Engineer

with running inference workloads in AI clusters with different inference frameworks like vLLM, SGLang. Running performance... your career. THE ROLE: AMD is looking for an AI Cluster Validation Engineer who is passionate about complex AI solutions...

Posted Date: 21 Jan 2026

Senior Deep Learning Compiler Engineer

compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms... Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning Compiler (DLC) team. Academic and commercial...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 16 Jan 2026

Senior GenAI Algorithms Engineer

etc) and related performance analysis and tuning Hands-on experience with inference and deployment environments would be an asset...We are now looking for a Senior Gen AI Algorithms Engineer! NVIDIA is seeking engineers to design, develop and optimize...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Jan 2026

Senior Deep Learning Algorithm Engineer

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... Engineer with hands-on experience optimizing and deploying Large Language Models (LLMs) and Vision-Language Models (VLMs...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Jan 2026

Senior System Software Engineer - Dynamo

improving performance of AI inference systems. Background with deep learning algorithms and frameworks. Especially experience...We are now looking for a Senior System Software Engineer to work on . NVIDIA is hiring software engineers for its GPU...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Jan 2026

Senior Robotics Systems Software Engineer - ROS

We are looking for a Senior Systems Software Engineer for our Robotics Team working on . Modern robot development... inference. NVIDIA’s ISAAC platform binds together high-fidelity visual and physical simulation, a high-quality development...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Jan 2026

Technical Marketing Engineer - AI Networking

accurate and impactful Drive the performance characterization of complex training and inference workloads on world-class..., and PyTorch, focused on validating and demonstrating distributed training and inference performance over NCCL, RoCE, and RDMA...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 13 Jan 2026