Microsoftβs mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Within AI Platform, the Multimodal Intelligence team empowers developers and data scientists around the world and of all skill levels to easily add multimodal AI capabilities to their apps. We are looking for a research Scientist to work on exciting challenges in Document Understanding and Computer Vision. #aiplatform
We are particularly interested in candidates with background in Computer Vision, Natural Language Processing and/or Artificial Intelligence, including topics like Video Understanding, multi-page multi-document question answering, novel ways of leveraging large language models for document understanding and solving problems inherent to large language models (grounding, retrieval-based generation, etc.) and other Multimodal topics. Familiarity with modern large language models is a plus, but not required.