ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Hsiao, Yu-Chung; Zubach, Fedir; Wang, Maria; Chen, Jindong

Computer Science > Computation and Language

arXiv:2209.08199 (cs)

[Submitted on 16 Sep 2022 (v1), last revised 22 Feb 2024 (this version, v2)]

Title:ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Authors:Yu-Chung Hsiao, Fedir Zubach, Maria Wang, Jindong Chen

View PDF HTML (experimental)

Abstract:We present a new task and dataset, ScreenQA, for screen content understanding via question answering. The existing screen datasets are focused either on structure and component-level understanding, or on a much higher-level composite task such as navigation and task completion. We attempt to bridge the gap between these two by annotating 86K question-answer pairs over the RICO dataset in hope to benchmark the screen reading comprehension capacity.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2209.08199 [cs.CL]
	(or arXiv:2209.08199v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2209.08199

Submission history

From: Yu-Chung Hsiao [view email]
[v1] Fri, 16 Sep 2022 23:49:00 UTC (10,966 KB)
[v2] Thu, 22 Feb 2024 08:07:33 UTC (11,029 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2209

Change to browse by:

cs
cs.CV
cs.HC

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators