Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Software Development Engineer- SGLang and Inference Stack, Location: Santa Clara, CA

Page: 1

Software Development Engineer- SGLang and Inference Stack

engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...

Posted Date: 12 Feb 2026

Senior Software Development Engineer – SGLang and Inference Stack

engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...

Posted Date: 12 Feb 2026

Senior Software Development Engineer - SGLang and Inference Stack

and align kernel-level optimizations with full-stack performance goals. Contribute to SGLang Development: Support... contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical...

Posted Date: 20 Dec 2025

Senior Software Development Engineer - LLM Kernel & Inference Systems

. This role is deeply focused on LLM inference stacks, including vLLM, SGLang, and internal inference platforms. You will work... RESPONSIBILITIES Optimize LLM Inference Frameworks Drive performance improvements in LLM inference frameworks such as vLLM, SGLang...

Posted Date: 20 Dec 2025

Senior Software Development Engineer – LLM Inference Framework

PREFERRED EXPERIENCE: Inference Stack Knowledge Hands-on understanding of vLLM, SGLang, or similar inference stacks...) and will be upstreamed into open-source inference frameworks such as vLLM and SGLang to make AMD a first-class platform for LLM serving...

Posted Date: 20 Dec 2025

Principal Software Engineer - Dynamo

and Python. Contribute to the development of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT... enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 02 Jan 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

co-design. Your work will span multiple layers of the AI software stack—ranging from algorithm design to integration... with strong foundations in both machine learning and software systems/architecture who are eager to make a broad impact across the AI stack...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Feb 2026