leadership in performance attribution, hardware/software observability, and defect localization across GPU compute stacks... PERSON: Are you a hands-on architect in areas like GPU/accelerator or HPC performance engineering, microarchitecture...
. Responsibilities: Architect solutions to scale and optimize Monitoring and Repair for components like GPU, CPU, Network, Storage...Job Category: Product Development Job Description: Our team is the GPU Availability and Monitoring team in the...