Job Search Results

Senior Software Engineer, TensorRT-LLM

We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 04 Mar 2026

Senior Software Engineer – TensorRT Edge-LLM

-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... with popular LLM frameworks and libraries such as TensorRT, TensorRT-LLM, vLLM, SGLang, MLC-LLM, or FlashInfer. A track record...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 13 Feb 2026

Principal Software Engineer – Large-Scale LLM Memory and Storage Systems

, this platform enables efficient, resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer... serving engines (such as vLLM, SGLang, TensorRT-LLM), with a focus on KV-cache offload, reuse, and remote sharing...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 23 Dec 2025

Senior Deep Learning Software Engineer, Inference and Model Optimization

and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 27 Feb 2026

Senior System Software Engineer - Dynamo-Triton Inference Server

We are looking for a Senior System Software Engineer to work on . NVIDIA is hiring software engineers for its GPU...-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 20 Feb 2026

Senior Deep Learning Software Engineer

We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 31 Jan 2026

Senior Deep Learning Software Engineer, Inference and Model Optimization

and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 23 Jan 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

's AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...—within NVIDIA’s ecosystem (TensorRT Model Optimizer, Megatron-LM, Megatron-Bridge, Nvidia-NeMo, NeMo-AutoModel, TensorRT-LLM...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 01 Feb 2026

Senior Deep Learning Algorithm Engineer

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang. As NVIDIA makes inroads into the Datacenter business...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 15 Jan 2026

Senior Developer Technology Engineer - Windows AI Platform

. Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software... LLM and GenAI software. Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite. Some travel...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 24 Jan 2026

Principal Machine Learning Engineer (Prisma AIRS)

from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research on cutting-edge areas...., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these areas are a significant plus. Experience...

Apply Now

Company: Palo Alto Networks

Location: Santa Clara, CA

Posted Date: 22 Feb 2026

Principal Machine Learning Platform Engineer (Prisma AIRS)

with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these areas... from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority...

Apply Now

Company: Palo Alto Networks

Location: Santa Clara, CA

Posted Date: 30 Jan 2026

Find your dream job now!

Keywords: Senior Software Engineer, TensorRT-LLM, Location: Santa Clara, CA

Page: 1

Senior Software Engineer, TensorRT-LLM

Senior Software Engineer – TensorRT Edge-LLM

Principal Software Engineer – Large-Scale LLM Memory and Storage Systems

Senior Deep Learning Software Engineer, Inference and Model Optimization

Senior System Software Engineer - Dynamo-Triton Inference Server

Senior Deep Learning Software Engineer

Senior Deep Learning Software Engineer, Inference and Model Optimization

Senior GenAI Algorithms Engineer — Post-Training Optimizations

Senior Deep Learning Algorithm Engineer

Senior Developer Technology Engineer - Windows AI Platform

Principal Machine Learning Engineer (Prisma AIRS)

Principal Machine Learning Platform Engineer (Prisma AIRS)