Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Development Engineer - LLM Kernel & Inference Systems, Location: Santa Clara, CA

Page: 1

Senior Software Development Engineer - LLM Kernel & Inference Systems

GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the... inference systems (e.g., FasterTransformer), with demonstrated performance tuning. * GPU Kernel Development Proven experience...

Posted Date: 19 Dec 2025

Senior Software Development Engineer – LLM Inference Framework

your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building.... This role sits at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends. THE PERSON...

Posted Date: 19 Dec 2025

Senior Software Development Engineer – SGLang and Inference Stack

, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems... engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux...

Posted Date: 12 Feb 2026

Senior Software Development Engineer - SGLang and Inference Stack

and tune large-scale training and inference models for optimal performance on AMD hardware. GPU Kernel Development: Design..., and enabling training and inference at scale across multi-GPU and multi-node systems. You will collaborate across internal GPU...

Posted Date: 20 Dec 2025

Principal Software Engineer - AI Inference

NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference... to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Feb 2026

Senior AI Inference Compiler Engineer

of deep learning models, algorithms and frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations.... Today, we are increasingly known as “the AI computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Feb 2026

Senior Software Engineer – TensorRT Edge-LLM

-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... development for critical transformer components such as attention, GEMM, and MoE. Benchmark, profile, and optimize inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Feb 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference... architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Feb 2026

Senior Deep Learning Framework Communications Engineer

, and Inference Engines such as TRT-LLM, vLLM, SGLang Rapid prototyping and development with Python, C++, CUDA or related DSLs... engineer to bring advanced communication technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Principal Machine Learning Engineer (Prisma AIRS)

from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research on cutting-edge areas... understanding of attention mechanisms and related knowledge is a plus. Demonstrated expertise with modern LLM inference engines (e.g...

Location: Santa Clara, CA
Posted Date: 22 Feb 2026

Principal Machine Learning Platform Engineer (Prisma AIRS)

from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority... of our AI platform - ML inference. Beyond individual contribution, you will lead complex technical projects, mentor senior engineers...

Location: Santa Clara, CA
Posted Date: 29 Jan 2026