on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving... boundaries of what's possible in large-scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to-vllm...
DESCRIPTION The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used... on the customer AWS Trainium and Inferentia silicon and servers. Strong software development using Python, System level...