ScreenAI: A visual language model for UI and visually-situated language understanding
Posted by Srinivas Sunkara and Gilles Baechler, Software Engineers, Google Research Screen user interfaces (UIs) and infographics, such as charts,...
SCIN: A new resource for representative dermatology images
Posted by Pooja Rao, Research Scientist, Google Research Health datasets play a crucial role in research and medical education, but...
MELON: Reconstructing 3D objects from images with unknown poses
Posted by Mark Matthews, Senior Software Engineer, and Dmitry Lagun, Research Scientist, Google Research A person’s prior experience and understanding...
Google at APS 2024
Posted by Kate Weber and Shannon Leon, Google Research, Quantum AI Team Today the 2024 March Meeting of the American...
AMIE: A research AI system for diagnostic medical reasoning and conversations
Posted by Alan Karthikesalingam and Vivek Natarajan, Research Leads, Google Research The physician-patient conversation is a cornerstone of medicine, in...
Can large language models identify and correct their mistakes?
Posted by Gladys Tyen, Intern, Google Research LLMs are increasingly popular for reasoning tasks, such as multi-turn QA, task completion,...