of software and hardware. KEY RESPONSIBILITIES: Support AMD’s RCCL, an open source, GPU-accelerated communication collective... working with Linux Kerner, Device drivers and network drivers. Experience designing and building GPU Networks for Large Scale...
and hardware. KEY RESPONSIBILITIES: Support AMD’s RCCL, an open source, GPU-accelerated communication collective middleware... clusters and debug complex system level issues that could span across different layers of the software stack: gpu kernel...