Screenshot Parsing as Pretraining for Visual Language Understanding
- Paper
- Oct 7, 2022
- #ComputerScience #Cognitivescience
Visually-situated language is ubiquitous -- sources range from textbooks with diagrams to web pages with images and tables, to mobile apps with buttons and forms. Perhaps due to thi...
Show More