engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...
engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...
and align kernel-level optimizations with full-stack performance goals. Contribute to SGLang Development: Support... contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical...
. This role is deeply focused on LLM inference stacks, including vLLM, SGLang, and internal inference platforms. You will work... RESPONSIBILITIES Optimize LLM Inference Frameworks Drive performance improvements in LLM inference frameworks such as vLLM, SGLang...
PREFERRED EXPERIENCE: Inference Stack Knowledge Hands-on understanding of vLLM, SGLang, or similar inference stacks...) and will be upstreamed into open-source inference frameworks such as vLLM and SGLang to make AMD a first-class platform for LLM serving...
and Python. Contribute to the development of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT... enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project...
co-design. Your work will span multiple layers of the AI software stack—ranging from algorithm design to integration... with strong foundations in both machine learning and software systems/architecture who are eager to make a broad impact across the AI stack...