Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: ML Engineer - Inference Serving, Location: Palo Alto, CA

Page: 1

ML Engineer - Inference Serving

serving deployment pipeline for a custom vendor Integrate our inference stack into an online reinforcement learning pipeline... lifetime of any inference workload Tech stack Must have Python Redis S3-compatible Storage Model serving...

Company: Luma AI
Location: Palo Alto, CA
Posted Date: 23 Jan 2026

Senior Software Engineer, Inference Platform

, and multi-tenant service design Familiar with concepts in ML model serving and inference runtimes, even if not directly...About the Role We're looking for a Senior Engineer to help build the next-generation inference platform that supports...

Company: MongoDB
Location: Palo Alto, CA
Posted Date: 09 Jan 2026

Staff Software Engineer - AI/ML Infra

an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure with a focus on Large Language... stores for ML model training and inference pipelines Build and optimize LLM inference systems using frameworks like vLLM...

Company: GEICO
Location: Palo Alto, CA
Posted Date: 26 Nov 2025

Software Engineer - Reliability

GPU clusters for AI/ML workloads (training or inference). Familiarity with job management systems based on Kubernetes...-principles engineer who is fluent in Linux, comfortable operating close to the metal, and capable of architecting systems for the...

Company: Luma AI
Location: Palo Alto, CA
Posted Date: 07 Dec 2025

Principal Cloud Backend Engineer

for our inference serving and monetization platform. Design systems that are fault-tolerant, highly available, and can scale to meet... platforms for serving, scaling, and managing AI models (e.g., inference servers, model deployment pipelines). What You'll...

Company: SambaNova
Location: Palo Alto, CA
Posted Date: 28 Nov 2025

Cloud Platform Engineer

computational problems. The Role As a Cloud Platform Engineer, you will be specializing in our AI Inferencing Service... and operations, applying an engineering mindset to solve operational challenges. Your primary focus will be ensuring our inference...

Company: SambaNova
Location: Palo Alto, CA
Posted Date: 23 Nov 2025

Senior Cloud Platform Engineer

computational problems. The Role As a Senior Cloud Site Reliability Engineer (SRE) specializing in our AI Inferencing Service... and operations, applying an engineering mindset to solve operational challenges. Your primary focus will be ensuring our inference...

Company: SambaNova
Location: Palo Alto, CA
Posted Date: 23 Nov 2025