: Define and architect innovative features for next-generation GPU memory and on-chip interconnect subsystems. Develop... architect to contribute to the development of future high-performance GPU computing systems. Ideal candidates...
architecture and micro-architecture features to improve the state-of-the-art in GPU memory systems, optimizing along the axes...NVIDIA is seeking a world-class computer architect to contribute to the development of future high-performance...
a motivated system architect to define future aspects of our GPU through employing pioneering technologies. Your role... and power efficiency, area, yield, effort, and schedule. Benchmark GPU configurations (core count, memory and interconnect...
A key part of NVIDIA's strength is to innovate how we architect and develop our GPU for the changing AI and accelerated.... We are looking for a Principal System Architect with a wealth of experience in shaping and bringing to fruition high-performance, high-volume System...
purpose computation on the GPU. Our team delivers features and improvements to better realize the potential of NVIDIA hardware... capabilities. To accomplish this, the CUDA driver interacts with GPU hardware, kernel mode drivers, and the operating system...
NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the... PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep...
like a single system at datacenter scale. As large language models rapidly outgrow the memory and compute budget of any single GPU... and evolve a unified memory layer that spans GPU memory, pinned host memory, RDMA-accessible memory, SSD tiers, and remote file...
for all of Intel’s processor platforms, both CPU and GPUs. Position Overview We’re hiring a Senior Compiler Architect to own the... runtime architecture for Intel’s open‑source compiler stack (CPU + GPU/accelerators). You will set technical direction, design...
in one or more of the following: CPU or GPU, Memory sub-systems, Fabrics, CPU/GPU coherency, Multimedia, I/O subsystems, Clocks, Resets... switch fabrics (coherent and non-coherent) Experience with modern heterogenous systems including CPU, GPU...
& Analysis Maintain deep knowledge of AI platform architectures (GPUs, TPUs, custom accelerators), focusing on compute, memory... counterpart to senior engineering leaders at hyperscale customers. Build credibility and long-term relationships by engaging...
We are hiring senior engineers to work on the CUDA driver and runtime, core components of our platform for accelerating... general purpose computation on the GPU. Our team analyzes performance of applications, investigates bottlenecks in software...
, generative AI fabrics, and global-scale networks that demand uncompromising performance. As a Senior Distinguished Engineer... growth, delivering the networking technology that fuels large-scale GPU clusters and AI fabrics. Our scale-out architectures...
architect for a Senior System Engineer role for system bringup and datacenter applications. Be a key player to the most exciting.... You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms...
Always-On Profiling (AON) team as a Senior Software Architect, where you'll be pivotal in designing, implementing, and leading...Are you ready to innovate GPU performance analysis for Machine Learning workloads?! Join our Developer Tools...
HW acceleration engines and software verification test plans. Knowledge of CPU, GPU architectures, memory coherence... of our platform for accelerating general purpose computation on the GPU. Our team delivers features and improvements to better realize...
, sensor, or rendering software. Familiarity with GPU processing and rendering pipelines, synchronization, GPU memory... open-source framework for sensor AI, enabling developers to build, optimize, and deploy GPU-accelerated pipelines...
large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU...., PyTorch) and inference engines (e.g., vLLM and SGLang). Familiarity with GPU programming and performance: CUDA, memory...
testplans. Knowledge of CPU, GPU architectures, memory coherence and consistency models Some familiarity with kernel mode...We are hiring senior engineers to work on the CUDA driver, a core component of our platform for accelerating general...