Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Engineer, TensorRT-LLM, Location: Santa Clara, CA

Page: 1

Senior Software Engineer, TensorRT-LLM

We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 16 Aug 2025

Senior Deep Learning Software Engineer, LLM Performance

We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep... Learning Engineer passionate about analyzing and improving the performance of LLM inference! NVIDIA is rapidly growing...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 31 Jul 2025

Senior Deep Learning Software Engineer, Inference and Model Optimization

and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 18 Oct 2025

Senior Systems Software Engineer, TAO Machine Learning Data Modeling

NVIDIA is hiring a Senior Systems Software Engineer for machine learning data modeling to join the TAO Toolkit ML Data... and familiar with deep learning architectures and tools like NVIDIA TensorRT-LLM, Multimodal-LLM, and Triton Server. NVIDIA...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 12 Oct 2025

Senior Software Engineer, Machine Learning Inference

the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the... TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Oct 2025

Senior System Software Engineer, AI Infrastructure

and motivated System Software Engineer who is passionate about AI Infrastructure. You will collaborate with engineering, product..., and distributed training methods. Understanding of CPU/GPU architecture plus CUDA, cuDNN, TensorRTLLM, Triton, NCCL Excellent...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 07 Aug 2025
Salary: $120000 - 189750 per year

Senior DevOps and Build Systems Engineer

We are now seeking a Senior DevOps and Build Systems Engineer for NVIDIA AI TensorRT-LLM team. This is a unique... infrastructure from first principles needed to deliver TensorRT LLM Maintain CI/CD pipelines to automate the build, test...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 21 Oct 2025

Senior GenAI Algorithms Engineer — Model Optimizations for Inference

design to integration—within NVIDIA’s ecosystem (TensorRT Model Optimizer, NeMo/Megatron, TensorRT-LLM) and open-source... stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Deploy optimized models into leading OSS inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Sep 2025

Senior GenAI Algorithms Engineer — Post-Training Optimizations

's AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...—within NVIDIA’s ecosystem (TensorRT Model Optimizer, Megatron-LM, Megatron-Bridge, Nvidia-NeMo, NeMo-AutoModel, TensorRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 19 Sep 2025

Senior Deep Learning Algorithm Engineer

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang. As NVIDIA makes inroads into the Datacenter business...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 08 Aug 2025

Senior DL Algorithms Engineer - Cosmos

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... AI applications. Convert, deploy, and optimize models for efficient inference using frameworks such as TensorRT, TensorRT-LLM, vLLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 06 Aug 2025

Software Architect, NIM Factory

crowd: Hands-on with LLM inference stacks (Triton Inference Server, TensorRT-LLM, vLLM). Experience optimizing large...NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Software Architect...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 16 Sep 2025