Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: AI Performance Architect, Training and Inference, Location: Santa Clara, CA

Page: 1

AI Performance Architect, Training and Inference

in inference, fine tuning and/or training MS with years of related experience or PhD with years of related experience in Computer... of developers pushing the boundaries of efficiency and performance to enable and optimize the software ecosystem for the...

Posted Date: 11 Sep 2025

Senior Solutions Architect, Generative AI Inference and Deployment

? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on inference... for Generative AI and Large Language Models (LLMs). You will also collaborate with a diverse set of internal teams on performance...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Jul 2025

Principal Software Engineer - Inference as a Service

, ensure service stability, and deliver high-performance, low-latency inference at a massive scale. What you'll be doing...-time observability, performance profiling, and debugging of inference services. Drive architectural decisions...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 22 Aug 2025

Performance Modeling Architect- Data Center GPU

performance gains in both training and inference pipelines through innovative system design and optimization. You will champion... and optimization for multi-trillion parameter LLM training/inference including Dense, Mixture of Experts (MoE) with multiple modalities...

Posted Date: 18 Sep 2025

Senior Deep Learning Performance Architect, Compute Energy Efficiency

We are now looking for a Senior Deep Learning Performance Architect for Compute Energy Efficiency! NVIDIA is seeking... Computing and parallel programming models such as CUDA Experience with deep neural network training, inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 17 Aug 2025

Senior Deep Learning Performance Architect

We are now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking outstanding Performance.... Background in GPU or Deep Learning ASIC architecture evaluation for training and/or inference. Strong programming skills...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Aug 2025

Senior Deep Learning Performance Architect

We are now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking outstanding Performance... with deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, JAX) Intelligent machines...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 18 Jul 2025

Senior Solutions Architect, GPU - Cloud Service Providers

performance aspects related to tasks like large scale LLM training and inference. Conducting regular technical customer meetings...) experience. Hands-on experience building performance benchmarks for data center systems, including large scale AI training...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Oct 2025

Senior Solutions Architect - GPU

performance aspects related to tasks like large scale LLM training and inference. Conducting regular technical customer meetings...) experience. Hands-on experience building performance benchmarks for data center systems, including large scale AI training...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 30 Sep 2025

Senior Software Architect, Agentic AI

, robotics, gaming, virtual reality, and high-performance computing. We are now looking for a senior SW AI architect to help... to solve real-world engineering problems. Experience with training/fine-tuning custom models, building multi-agent systems...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 20 Sep 2025

Senior Architect, AI-assisted Code Development

-performance computing. We are now looking for a senior software architect with expertise in AI-assisted coding to help improve.... Experience with training/fine-tuning custom models, building multi-agent systems, retrieval augmented generation (RAG) pipelines...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 20 Sep 2025

Solutions Architect, Applied AI

profiling and optimization for AI training and inference workloads. Ability to learn fast and quickly adapt to change. Clear... training foundational models Experience on high-performance NVIDIA GPU computing clusters. Extensive engineering...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 04 Sep 2025

Solutions Architect, DGX Cloud

? We are looking for a hardworking Solution Architect (SA) to join the DGX Cloud SA Segment Team. The mission of the DGX Cloud Segment team is to guide... user experience. Additionally, we will collaborate with internal teams to scale expertise and knowledge through training...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Sep 2025

Senior Solutions Architect, GPU - Cloud Service Providers

performance aspects related to tasks like large scale LLM training and inference. Conducting regular technical customer meetings...) experience. Hands-on experience building performance benchmarks for data center systems, including large scale AI training...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 23 Aug 2025

Senior Solutions Architect, Generative AI

AI software libraries and GPUs. Experience with profiling and optimizing model training/inference performance on GPUs...NVIDIA is looking for an AI Solutions Architect with hands-on experience in efficient AI model training...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 11 Aug 2025

Solutions Architect - Accelerated Computing

and application deployment as well as optimization towards large-scale AI training and inference and HPC Build custom product...We are looking for an Accelerated Computing Solutions Architect! NVIDIA is searching for a Solutions Architect...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 13 Jul 2025

Senior Datacenter System Software Architect - DGX Cloud

. On this team, you will do full stack deployment including hardware architecture, workload orchestration and application performance... your software integrates seamlessly from the hardware all the way up to the AI training applications. What we need...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 17 Aug 2025

Senior Storage and Networking Product Engineer

-performance infrastructure. This role is vital for the flawless operation of NVIDIA’s innovative compute platforms, integrating..., this is the opportunity to redefine data movement, system resilience, and automation! What you'll be doing: Architect, deploy...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 27 Sep 2025

Senior Staff Software Development Engineer- GPU/AI/ML

: Architect and Drive the AI Software Stack: You will establish best practices and optimize performance from the lowest-level GPU..._ THE ROLE: AMD is looking for an influential software engineer who is passionate about improving the performance of key...

Posted Date: 20 Sep 2025

Senior Staff Software Development Engineer- GPU, LLM, AI

: Architect and Drive the AI Software Stack: You will establish best practices and optimize performance from the lowest-level GPU..._ THE ROLE: AMD is looking for an influential software engineer who is passionate about improving the performance of key...

Posted Date: 13 Sep 2025