and AI. Were seeking Senior Infrastructure Engineers to build and optimize distributed training and inference systems that power... and cross-team impact. Bonus Points Experience with large-scale ML training and inference workloads. Exposure to service...
) and bring a combination of experience in both economics and machine learning. More senior candidates... causal inference. ABOUT THE JOB You will help design and build end-to-end machine learning solutions and optimization...
Learning workloads, especially Large Language Model inference Processor architecture, including ISA design & microarchitecture...., is consulted by senior leadership to make key decisions). Most tasks do not have defined steps; simultaneous use of multiple...
inference (TensorRT, CoreML, TVM, ONNX, TFLite, WebGPU, or similar). Familiarity with CTV / ACR pipelines (frame grabbers...
inference (TensorRT, CoreML, TVM, ONNX, TFLite, WebGPU, or similar). Familiarity with CTV / ACR pipelines (frame grabbers...
inference (TensorRT, CoreML, TVM, ONNX, TFLite, WebGPU, or similar). Familiarity with CTV / ACR pipelines (frame grabbers...
innovation in large-scale AI. Our mission is to enable state-of-the-art large language model (LLM) training and inference through...) serving technologies, including distributed execution and inference optimization. Strong problem-solving skills...
computing Contribute and advance open source , , Solve large-scale, end-to-end AI training and inference challenges... with AI Frameworks (e.g. PyTorch, JAX, Ray), and/or inference and deployment environments (e.g. TRTLLM, vLLM, SGLang). Proficient...
experience. Experience using or developing Machine Learning training or inference software Experience with cross-team...
deployment software libraries and stacks (e.g. NVIDIA TensorRT, Triton Inference Server, OnnxRuntime) Experience deploying...
deployment software libraries and stacks (e.g. NVIDIA TensorRT, Triton Inference Server, OnnxRuntime) Experience deploying...
Strong programming skills in Pytorch and C/C++ Experience working with distributed training, post-training, or inference pipelines...
and inference jobs. You will craft software services to deliver functionality to NVIDIA's internal platforms and our external-facing...
, and edge training/inference. Elicit functional requirements from end users and data science teams, utilizing methods... CI/CD pipelines for models in cloud environments, including batch, online, streaming, and edge training/inference. Elicit...
computing Contribute and advance open source and Solve large-scale, end-to-end AI training and inference challenges... with AI Frameworks (e.g. PyTorch, JAX), and/or inference and deployment environments (e.g. TRTLLM, vLLM, SGLang). Proficient in Python...
, deployment, inference optimization Python and C++ experience Hands-on experience with building a perception stack...
and build benchmarks for MLPerf Inference, the industry-leading benchmark suite for inference system-level performance, as well... be doing: Design and implement highly efficient inference systems for large-scale deployments of generative AI models...
Summary: Qualcomm Cloud System Hardware Engineering team develops cutting-edge rack level AI inference solutions... integration, and PCB definition/design. Participate in the design and development of rack-level inference solutions, ensuring...
infrastructure, including containerized training/inference and workflow orchestration (e.g., Airflow). Partner with Technology (IT... with an RDBMS Experience supporting both batch and real time inference (REST/gRPC, streaming). Proficiency in Python and Bash...
infrastructure, including containerized training/inference and workflow orchestration (e.g., Airflow). Partner with Technology (IT... with an RDBMS Experience supporting both batch and real time inference (REST/gRPC, streaming). Proficiency in Python and Bash...