Create evaluations and establish evaluation frameworks to measure both technical/practical performance and non-deterministic performance like EQ Research & implement novel prompting techniques Spin the data flywheel to extract insights from...
Leverage expertise to measure the performance of Copilot, identify failure modes and novel mitigation strategies, including data mining, prompt engineering, LLM as a judge, and classifier training. Creative problem solving, navigating compl...
Build and lead a team of backend and machine learning engineers, including driving project planning, prioritization of work, and designing features. Guides teams and leads identification of dependencies and the development of design documen...
Deeply understand the pipeline of collecting data, training and then serving language models and multimodal models Have experience in working side-by-side with researchers and engineers Thrive in a fast-paced, innovative environment Are pas...
Own the Model Feedback Lifecycle: Design and manage processes for collecting, triaging, and analyzing feedback from production usage. Build dashboards and reporting systems to surface insights and drive data-informed improvements to Copilot...
Design and build large-scale Copilot memory and personalization systems leveraging search, embeddings, retrieval, ranking, and Retrieval-Augmented Generation (RAG). Apply subject-matter expertise in cross-product features, collaborating wit...
Design and build large-scale Copilot memory and personalization systems leveraging search, embeddings, retrieval, ranking, and Retrieval-Augmented Generation (RAG). Apply subject-matter expertise in cross-product features, collaborating wit...
Own and manage the back-end infrastructure for the Copilot Voice & Vision stack, applying technical expertise in modern web stacks, networking, and large language model (LLM) serving. Serve and operate live voice and vision endpoints, inclu...
Leverage expertise in AI safety to uncover potential risks and develop novel mitigation strategies, including alignment techniques, constitutional AI approaches, RLHF, and robustness improvements for large language models. Create and implem...
Works with appropriate stakeholders to determine user requirements for a set of features. Contributes to the identification of dependencies, and the development of design documents for a product area with little oversight. Creates and imple...
Partners with appropriate stakeholders to determine user requirements across different scenarios. Leads identification of dependencies and development of design documents for the application and service. Assists others to produce extensible...
Software development in C++, Python, and other languages for specialized accelerators and host systems. Software design, development and optimization to execute AI workloads at the cloud scale Co-design with hardware partner teams on HW/SW ...
Guides partnership with appropriate stakeholders (e.g., project manager, technical lead) to determine user requirements within and across teams. Guides teams and leads identification of dependencies and the development of design documents f...
Lead post-training data collection (e.g. human evals, training data), including vendor selection, contract negotiation, and management. Act as primary data operations POC for key research efforts and pillars, including Multimodality (images...
Provides technical leadership related to building large language model-driven Generative Search projects, and prototype, develop and ship these projects. Partners with stakeholders to determine user requirements for a set of scenarios. Lead...
Leads by example, contributing hands-on design work while guiding a small team. Creates compelling experience concepts with a high standard of craftsmanship and interactivity. Actively integrates AI tools and methods into the design process...
Ship world-class AI-powered experiences that delight advertisers, publishers, sellers and consumers with an emphasis on craft, quality, trust, and positive user and societal impact Own a product area and be responsible for understanding use...
As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where e...
Training pipelines that scale elegantly - Design and implement robust training infrastructure that handles everything from data ingestion to model versioning, making it trivial for ML engineers to experiment and deploy with confidence The d...
Inform future directions of Bing's A/B experimentation platform, and drive improvements to it. Establish long-term A/B experimentation capabilities. Understand A/B online experiments, online user engagement and produce actionable, trustwort...