serving deployment pipeline for a custom vendor Integrate our inference stack into an online reinforcement learning pipeline... lifetime of any inference workload Tech stack Must have Python Redis S3-compatible Storage Model serving...
, and multi-tenant service design Familiar with concepts in ML model serving and inference runtimes, even if not directly...About the Role We're looking for a Senior Engineer to help build the next-generation inference platform that supports...
an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure with a focus on Large Language... stores for ML model training and inference pipelines Build and optimize LLM inference systems using frameworks like vLLM...
GPU clusters for AI/ML workloads (training or inference). Familiarity with job management systems based on Kubernetes...-principles engineer who is fluent in Linux, comfortable operating close to the metal, and capable of architecting systems for the...
for our inference serving and monetization platform. Design systems that are fault-tolerant, highly available, and can scale to meet... platforms for serving, scaling, and managing AI models (e.g., inference servers, model deployment pipelines). What You'll...
computational problems. The Role As a Cloud Platform Engineer, you will be specializing in our AI Inferencing Service... and operations, applying an engineering mindset to solve operational challenges. Your primary focus will be ensuring our inference...
computational problems. The Role As a Senior Cloud Site Reliability Engineer (SRE) specializing in our AI Inferencing Service... and operations, applying an engineering mindset to solve operational challenges. Your primary focus will be ensuring our inference...