and opportunities at the kernel level. Develop, optimize, and evaluate both hand-tuned and compiler-generated kernels for inference... and compiler teams to iterate on kernel fusion, auto tuning, and sophisticated GPU programming techniques. Benchmark performance...
Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning Compiler (DLC) team. Academic and commercial... compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms...
methods, and validate improvements. If you’re passionate about system-level performance, compiler IR, and GPU kernel...We're now seeking a Senior AI Software Engineer, in our LLM Inference Performance Analysis and Optimization team...
BLAS or deep learning library kernel code CUDA/OpenCL GPU programming Numerical methods and linear algebra LLVM, TVM...We are now looking for a Senior Performance Software Engineer for Deep Learning Libraries! Do you enjoy tuning parallel...
: Strong programming skills in C/C++ and Python. Experience with GPU kernel programming using CUDA, HIP or OpenCL. Experience..._ THE ROLE: As a Forward Deployed Software Engineer, you will work closely with our most strategic partners as a hands...