those involving compound AI. Experience with processor and system-level performance modeling. GPU programming experience with CUDA... boundaries of what's possible with LLMs by improving the algorithmic performance and efficiency of systems that represent them...
, and drive proactive mitigations. Apply AI to identify emerging usage trends, performance hotspots, and workload irregularities... analysis to improve live-site reliability and engineering velocity. Contribute to hotfixes, performance tuning, and reliability...
/memory optimization, high-performance GPU programming, compiler-based optimization, fine-grained parallel library and runtime...., communication optimization, network architecture design, network programming). Experience on high performance computing (e.g., cache...
of hyperscale infrastructure and AI, shaping the operational backbone of one of the largest GPU clusters in private deployment... of a founding engineering team, this is the place to do it. Responsibilities: Design, deploy, and maintain large-scale GPU...
in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention, the GPU, serves...As one of the technology industry's most desirable employers, NVIDIA is an industry leader in high-performance...
AI/Machine Learning (ML) inference, Graphics Processing Unit (GPU) clusters, or High-Performance Computing (HPC) systems... developments that will improve the availability, reliability, efficiency, observability, and performance of products...
to software. Evaluate and make recommendations that advance Azure infrastructure for AI and other GPU-based workloads. Leads... performance and maintainability, effectiveness, and return on investment (ROI). Applies metrics to drive the quality and stability...
potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self...: System/board level debug and validation of NVIDIA's products in fields like GPU cards, MGX servers, automotive, robotics...
pipelines and infra. Performance Optimization: Analyze system performance and scalability, optimize resource utilization... (compute, GPU clusters, storage, networking). Automation & Tooling: Build automation for deployments, incident response...
distributed systems in Kubernetes and containerized environments using Docker. Familiarity with GPU resource management, including... NVIDIA GPU Operator and device plugin lifecycle. Experience with CI/CD workflows and infrastructure automation tools...
NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics..., and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU...
an inclusive and high-performance culture. Contribute to engineering practices and continuous improvement. Bachelor's Degree...., COM, gRPC). Experience in debugging, including telemetry and performance measurement. Ability to diagnose complex issues...
, debug qual fails, generate daily qual summary and final reports Experience with characterizing CPU, GPU, Memory..., Multimedia, Audio, Thermal and Power performance of SOC/module is required. Strong understanding of software automation...
across multiple hardware generations. Own the end-to-end lifecycle of GPU drivers (NVIDIA, amdgpu, ROCm), including:Integration of out... debugging across the full stack - from kernel to device firmware - to resolve performance, stability, or bring-up issues. Drive...
Develop compute-intensive and performance optimized test applications running on the MAIA hardware systems, focusing... for functional and performance scenarios of the system. Collaborate across organizations with AI frameworks software, silicon...
stress and performance scenarios to meet test plan goals Actively participate in chip bring up and write test firmware... in pre-silicon validation with a proven track record of delivering high performance Network switches/accelerators, Central...
, Virtualization and Security Experience analyzing CPU, GPU or System-level Micro-Architectural features to identify performance... your career. THE ROLE: AMD is looking for an engineering leader passionate about driving the best Power, Performance, Area...
of delivering high performance Central Processing Unit (CPU), Vector processors and Graphics Processing Unit (GPU's) or relevant..., creating stress and performance scenarios to meet test plan goals Actively participate in chip bring up and write test firmware...
potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self... from model fine-tuning and quantization to low-level kernel development and performance optimization. Develop workflows...
distributed systems in Kubernetes and containerized environments using Docker. Familiarity with GPU resource management, including... NVIDIA GPU Operator and device plugin lifecycle. Experience with CI/CD workflows and infrastructure automation tools...