's innovation around Generative AI applications, focusing on distributed AI inference and agent-based infrastructure. This role...) implementations for distributed inference and agent-based systems, ensuring they are operational and ready for the next phase. Work...
, optimize, and scale LLM deployments. As a Principal Machine Learning Engineer focused on distributed infrastructure in the... distributed inference infrastructure leveraging Kubernetes APIs, operators, and the Gateway Inference Extension API for scalable...
, and scale LLM deployments. As a Principal Machine Learning Engineer focused on vLLM, you will be at the forefront... LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational...
Job Summary: The Red Hat Ecosystems Engineering group is seeking a Senior Principal Software Engineer in our Boston... AI applications Architect and lead implementation of scalable solutions with distributed computing capabilities to deploy, train...