Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published April 15, 2019 | Version 2019-01-25
Dataset Open

Webis-Web-Errors-19

  • 1. Bauhaus-Universität Weimar
  • 2. Leipzig University

Description

The Webis-Web-Errors-19 comprises various annotations for the 10,000 web page archives of the Webis-Web-Archive-17. The annotations are whether the page is (1) mostly advertisement, (2) cut off, (3) still loading, (4) pornographic; and whether it shows (not/a bit/ very) (5) pop-ups, (6) CAPTCHAs, or (7) error messages. If you use this dataset in your research, please cite it using this paper.

Files

annotation-interface.png

Files (681.6 kB)

Name Size Download all
md5:7aaa146411dbc0d770dbd319f00cd864
263.5 kB Preview Download
md5:ed0aa6eb30370b38f69265f463c59d5c
101.7 kB Preview Download
md5:862ffd4469c832922af95007ca4b3a44
4.1 kB Preview Download
md5:098d08ef7c0e0c69023b27951277caa1
312.3 kB Preview Download

Additional details

Related works