intersection of distributed systems, kernel-level performance engineering, and large-scale model training. The ideal candidate can..., distributed training frameworks, or performance-critical kernels. Prior experience collaborating directly with hardware vendors...