Job Search Results

Software Development Engineer- SGLang and Inference Stack

engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 12 Feb 2026

Senior Software Development Engineer – SGLang and Inference Stack

engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 12 Feb 2026

Senior Software Development Engineer - SGLang and Inference Stack

and align kernel-level optimizations with full-stack performance goals. Contribute to SGLang Development: Support... contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 20 Dec 2025

Senior Software Development Engineer - LLM Kernel & Inference Systems

. This role is deeply focused on LLM inference stacks, including vLLM, SGLang, and internal inference platforms. You will work... RESPONSIBILITIES Optimize LLM Inference Frameworks Drive performance improvements in LLM inference frameworks such as vLLM, SGLang...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 20 Dec 2025

Senior Software Development Engineer – LLM Inference Framework

PREFERRED EXPERIENCE: Inference Stack Knowledge Hands-on understanding of vLLM, SGLang, or similar inference stacks...) and will be upstreamed into open-source inference frameworks such as vLLM and SGLang to make AMD a first-class platform for LLM serving...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 20 Dec 2025

Principal Software Engineer - Dynamo

and Python. Contribute to the development of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT... enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 02 Jan 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

co-design. Your work will span multiple layers of the AI software stack—ranging from algorithm design to integration... with strong foundations in both machine learning and software systems/architecture who are eager to make a broad impact across the AI stack...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 01 Feb 2026

Find your dream job now!

Keywords: Software Development Engineer- SGLang and Inference Stack, Location: Santa Clara, CA

Page: 1

Software Development Engineer- SGLang and Inference Stack

Senior Software Development Engineer – SGLang and Inference Stack

Senior Software Development Engineer - SGLang and Inference Stack

Senior Software Development Engineer - LLM Kernel & Inference Systems

Senior Software Development Engineer – LLM Inference Framework

Principal Software Engineer - Dynamo

Senior GenAI Algorithms Engineer — Post-Training Optimizations