. Today, we are increasingly known as “the AI computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring... backbone of NVIDIA’s inference engine, spanning across data centers, personal devices, automotive, and robotics. The compiler...
We are now looking for a Senior Machine Learning Applications and Compiler Engineer! NVIDIA is seeking engineers... to develop algorithms and optimizations for our inference and compiler stack. You will work at the intersection of large-scale...
Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning Compiler (DLC) team. Academic and commercial... compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms...
We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact... in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially...
. This role is deeply focused on LLM inference stacks, including vLLM, SGLang, and internal inference platforms. You will work... GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the...
your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building... architecture-level improvements in inference platforms. KEY RESPONSIBILITIES: Inference Framework & Runtime Architect...
with SGLang or similar LLM inference frameworks is highly preferred. Compiler and GPU Architecture Knowledge: Background..., and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems...
inference parallelism and collective communication strategies. Graph Compiler Integration: Integrate and optimize runtime... or similar LLM inference frameworks is highly preferred. Compiler and GPU Architecture Knowledge: Background in compiler...
to do their best work. Come join the team and see how you can make a lasting impact on the world. We are looking for outstanding Senior... High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA ecosystem. We build innovative...
platforms by shifting complexity from silicon into software. We design and maintain the hardware abstraction layers, core system... libraries, and runtime components that allow compiler teams and data center operators to safely and efficiently execute...
engineer to bring advanced communication technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX... GPUs to inference down at microsecond latency. Communication performance between the GPUs has a direct impact...
to develop the best solution for partners working on our platforms. What you'll be doing: Developing compiler technologies... to accelerate deep learning inference on NVIDIA hardware platforms for Physical AI. Working across a wide range of abstractions...
to develop the best solution for partners working on our platforms. What you'll be doing: Developing compiler technologies... to do their best work. Come join the team and see how you can make a lasting impact on the world. We are looking for outstanding Senior...
with compiler infrastructure for large language model inference. Exposure to robotics or embedded AI pipelines, including...Are you passionate about pushing the limits of real-time large language model inference? Join NVIDIA’s TensorRT Edge...