to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM infrastructure... challenges in scalable inference systems and Kubernetes-native deployments. Your work with machine learning, distributed systems...
, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed infrastructure in the project... and/or Rust to integrate with the vLLM project and manage distributed inference workloads. Design and implement KV cache-aware...
. Red Hat’s Global Engineering team is looking for a Principal Software Engineer to join the Agentic and AI Engineering.... Experience with AI and Machine Learning platforms, tools, and frameworks, such as MLFlow, Llama Stack, LangChain, PyTorch, LLaMA...