Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Software Engineer, Inference Deployment, Location: USA

Page: 6

Senior Deep Learning Software Engineer, PyTorch - TensorRT Performance

We are now looking for a Senior Deep Learning Software Engineer, PyTorch-TensorRT Performance! NVIDIA is seeking... an experienced Deep Learning Engineer passionate about analyzing and improving the performance of Torch inference with TensorRT...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 13 Dec 2025

AI Software Engineer - Full Stack (Junior to Mid)

AI Full-Stack Software Engineer (Junior to Mid) The Human Resources Research Organization (HumRRO) is a non-profit..., and cost-efficient inference strategies. Own features end-to-end: frontend to backend to cloud deployment Collaborate...

Posted Date: 12 Dec 2025
Salary: $70000 - 110000 per year

Senior Software Engineer - Model Inferencing

models, and deliver innovative apps. The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference... Runtimes experience to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment...

Company: Red Hat
Location: Raleigh, NC
Posted Date: 07 Dec 2025
Salary: $116270 - 191840 per year

Software Engineer, Full Stack – Scale GP

retrieval, inference, evaluation, and more. We are seeking a strong Full-Stack Engineer to help us build, scale, and refine... our rapidly growing product. The ideal candidate is deeply grounded in software engineering best practices and experienced...

Posted Date: 07 Dec 2025

Senior Software Engineer, Full-Stack – Scale GP

retrieval, inference, evaluation, and more. We are seeking a strong Senior Full-Stack Engineer to help us build, scale..., and refine our rapidly growing product. The ideal candidate is deeply grounded in software engineering best practices...

Posted Date: 06 Dec 2025

MLOps Software Engineer (TS/SCI with Poly)

MLOps Software Engineer Job Type: Full-Time Clearance Required: Active Top Secret with Favorable Polygraph Location...? Our client is seeking a Software Engineer with experience in machine learning operations (MLOps), Kubernetes, and high...

Company: Staffed4U
Posted Date: 28 Nov 2025

Software Engineer - TS/SCI with Poly

Software Engineer $160k- $230k Location: Annapolis Junction, MD Clearance Required: TS/SCI with Polygraph... Position Summary: We are seeking a skilled and mission-driven Software Engineer to support the ML Frameworks team...

Company: Staffed4U
Posted Date: 28 Nov 2025

Software Engineer (Data & ML) - Mountain View, CA

looking for an engineer who can own data collection, scalable data systems, MLOps workflows and ML deployment. You'll work cross-functionally... performance; support CUDA kernels, TensorRT integration, and inference optimization. Build and maintain CI/CD and automated MLOps...

Company: Aeva
Location: Mountain View, CA
Posted Date: 27 Nov 2025
Salary: $154900 - 209600 per year

Staff Software Engineer - AI/ML Infra

an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure with a focus on Large Language... stores for ML model training and inference pipelines Build and optimize LLM inference systems using frameworks like vLLM...

Company: GEICO
Location: Chevy Chase, MD
Posted Date: 27 Nov 2025

Lead Forward Deployed Software Engineer

AI companies in the world? THE ROLE: As a Forward Deployment Software Engineer, you will work closely with our most strategic... business value. This role is a unique blend of customer relationship skills and elite software engineer; you will work side...

Posted Date: 26 Nov 2025

Staff Software Engineer - AI/ML Infra

an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure with a focus on Large Language... stores for ML model training and inference pipelines Build and optimize LLM inference systems using frameworks like vLLM...

Company: GEICO
Location: Palo Alto, CA
Posted Date: 26 Nov 2025

Sr. Full Stack Software Engineer

continue fulfilling this mission in the years to come! POSITION We are seeking a Senior Full Stack Software Engineer... and other frameworks) into production software for real-time inference. Collaborate with hardware engineers on sensor integration...

Posted Date: 20 Nov 2025

Lead Software Engineer

Job Category: Security Engineering Job Description: As Lead Security Engineer, you will design and optimize large... Generation (RAG) systems, managing vector databases and embedding models. Build and maintain scalable, secure inference...

Company: JPMorgan Chase
Location: Plano, TX
Posted Date: 13 Feb 2026

IT Software Engineer

to Have skills: Ability to create model inference systems with advanced deployment methods that integrate with other MLOps... operationalization and reliability of all models. We are searching for a driven and highly skilled MLOps Engineer to join our MLOps...

Company: Aditi Consulting
Location: Chicago, IL
Posted Date: 13 Feb 2026

IT Software Engineer

to create model inference systems with advanced deployment methods that integrate with other MLOps components like MLFlow... operationalization and reliability of all models. We are searching for a driven and highly skilled MLOps Engineer to join our MLOps...

Company: Aditi Consulting
Location: Chicago, IL
Posted Date: 13 Feb 2026

Staff Software Engineer, Machine Learning

Learning Engineer who operates at the boundary of multiple domains and partners closely with the Director of Engineering... Inference (Revenue or Growth) Strong hands-on engineering skills, with the ability to write, review, and debug production...

Company: Match Group
Location: Palo Alto, CA
Posted Date: 30 Jan 2026

Principal Software Engineer – Large-Scale LLM Memory and Storage Systems

, this platform enables efficient, resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer...NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Dec 2025

Staff Software Engineer, Infrastructure - Machine Learning

ML Infrastructure: Build and scale distributed systems for ML training, serving, and inference. Design and implement... distributed systems tailored for efficient ML training and seamless operational deployment. Feature Engineering Enhancement...

Company: Ryder
Location: USA
Posted Date: 13 Feb 2026

Software Engineer I, Monetization ML

and processing pipelines that support both real-time inference and batch training workflows Explore and build infrastructure... in Computer Science, Engineering, a related field, or equivalent experience - 1+ years of professional software development...

Company: Amazon
Location: San Francisco, CA
Posted Date: 12 Feb 2026

Software Engineer, Monetization ML

data ingestion and processing pipelines that support both real-time inference and batch training workflows - Explore... Qualifications - Bachelor's degree in Computer Science, Engineering, or a related field - 1+ years of professional software...

Company: Amazon
Location: Seattle, WA
Posted Date: 11 Feb 2026