Job Description
Overview
Microsoft’s Applied Sciences Group is seeking a visionary and hands-on Principal Applied Scientist to lead research and development in multimodal AI, with a dual focus on image understanding and autoregressive generation across language and vision. This role is ideal for candidates passionate about building real-world systems that unify visual and textual modalities to power next-generation user experiences across devices and platforms.
As a senior member of the team, you will drive innovation across model architecture, training, and deployment, especially for scalable autoregressive models that handle both language and image generation in a unified framework. You will also play a key role in converting cutting-edge research into practical applications and experiences for users across the globe.Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
1. Design and prototype unified token-based architectures that treat text and image data as sequences for coherent multimodal gener...