We are now looking for a Senior GPU Architect, Profiling System! NVIDIA’s invention of the GPU in 1999 sparked the... to contribute to the design of our proprietary profiler subsystem, the apparatus embedded in every GPU that enables our profiling...
We are now looking for a Senior Kernel Performance Architect for Deep Learning Software! NVIDIA is seeking... company. What you will be doing: Craft GPU-accelerated system architectures that push the boundaries of deep learning...
large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU... kernels and compilers, drive industry benchmarks, and scale workloads across multi-GPU, multi-node, and multi-cloud...
like a single system at datacenter scale. As large language models rapidly outgrow the memory and compute budget of any single GPU... across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU...