We are now looking for a Senior Deep Learning Architect, LLM Inference! NVIDIA is at the forefront of the generative... with the rapid pace of Generative AI. Contribute to deep learning software projects, such as PyTorch, TRT-LLM, vLLM...
your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building...) and will be upstreamed into open-source inference frameworks such as vLLM and SGLang to make AMD a first-class platform for LLM serving...
, and inference, all on a large scale! We are seeking a hands-on Solutions Architect with deep expertise in backend infrastructure... environments (e.g., AWS, Azure, GCP, on-prem). Accelerate inference pipelines using NVIDIA NIM, TensorRT-LLM, vLLM, SGLang...
from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority... of our AI platform - ML inference. Beyond individual contribution, you will lead complex technical projects, mentor senior engineers...