We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning...
We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep... Learning Engineer passionate about analyzing and improving the performance of LLM inference! NVIDIA is rapidly growing...
and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...
NVIDIA is hiring a Senior Systems Software Engineer for machine learning data modeling to join the TAO Toolkit ML Data... and familiar with deep learning architectures and tools like NVIDIA TensorRT-LLM, Multimodal-LLM, and Triton Server. NVIDIA...
the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the... TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications...
and motivated System Software Engineer who is passionate about AI Infrastructure. You will collaborate with engineering, product..., and distributed training methods. Understanding of CPU/GPU architecture plus CUDA, cuDNN, TensorRT‑LLM, Triton, NCCL Excellent...
We are now seeking a Senior DevOps and Build Systems Engineer for NVIDIA AI TensorRT-LLM team. This is a unique... infrastructure from first principles needed to deliver TensorRT LLM Maintain CI/CD pipelines to automate the build, test...
design to integration—within NVIDIA’s ecosystem (TensorRT Model Optimizer, NeMo/Megatron, TensorRT-LLM) and open-source... stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Deploy optimized models into leading OSS inference...
's AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...—within NVIDIA’s ecosystem (TensorRT Model Optimizer, Megatron-LM, Megatron-Bridge, Nvidia-NeMo, NeMo-AutoModel, TensorRT-LLM...
We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang. As NVIDIA makes inroads into the Datacenter business...
We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... AI applications. Convert, deploy, and optimize models for efficient inference using frameworks such as TensorRT, TensorRT-LLM, vLLM...
crowd: Hands-on with LLM inference stacks (Triton Inference Server, TensorRT-LLM, vLLM). Experience optimizing large...NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Software Architect...