of large-scale RL training frameworks to accelerate the development of multi-modal AV foundation models. Design, build... and enhancing simulation quality and refining the reward function for RL training. Ensure the reliability and performance...
Policy Optimization (PPO), and reward modeling to improve agent performance. Launch and support fine-tuned models... with applied AI/ML teams to translate state-of-the-art research in agentic reasoning, planning, and tool use into reliable...
and facilitate training of deep learning models Integrating machine learning into digital products and services by working cross... of shared responsibility and shared reward Interest in early research and development through to product roll-out in the...
operations and service training and education. This Managerial position regularly engages in business planning and analysis... for People Management processes including but not limited to selection, training, performance, operational results, cost...
About this role As a Machine Learning Research Engineer, you'll drive research that teaches models what great feels... or ML research engineering, especially in post-training/fine-tuning large models (SFT, RLHF, DPO). Experience with LLM...