WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics

Project homepage: https://uimodeling.github.io/

This repository contains the code and download scripts for the following papers:

WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics (CHI 2023 - Best Paper Honorable Mention)
WebUI: A Dataset of Web UIs and Associated Metadata to Support Computational UI Modeling (CHI 2023 Computational UI Workshop)

Description

The WebUI dataset contains 400K web UIs captured over a period of 3 months and cost about $500 to crawl. We grouped web pages together by their domain name, then generated training (70%), validation (10%), and testing (20%) splits. This ensured that similar pages from the same website must appear in the same split. We created four versions of the training dataset. Three of these splits were generated by randomly sampling a subset of the training split: Web-7k, Web-70k, Web-350k. We chose 70k as a baseline size, since it is approximately the size of existing UI datasets. We also generated an additional split (Web-7k-Resampled) to provide a small, higher quality split for experimentation. Web-7k-Resampled was generated using a class-balancing sampling technique, and we removed screens with possible visual defects (e.g., very small, occluded, or invisible elements). The validation and test split was always kept the same.

Repository Structure

Information about each directory:

crawler/ - contains code for the crawler used to collect the WebUI dataset
downloads/ - contains scripts to download datasets and pre-trained models
models/ - contains scripts for training and reproducing the experiments in the paper
notebooks/ - contains example notebooks for running the models
sample/ - a sample data point from the WebUI dataset
scripts/ - data processing, dataset generation, and model export scripts

Getting Started

First, install dependencies with pipenv install and activate a virtual environment with pipenv shell Example inference code is found in the notebooks/ directory. To train, check out the models/ directory. Alternatively, check out web demos of the models (no installation required).

Important Notes

Please see the COPYRIGHT.txt file for information about the data contained within this repository.
Not all data samples have the same number of files (e.g., same number of device screenshots) due to the fact that the crawler used a timeout during collection
The dataset released on HuggingFace was filtered using a list of explicit words and therefore contains fewer samples than the experiments originally used in the paper.

Citation

@inproceedings{wu2023webui,
  title={WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics},
  author={Wu, Jason and Wang, Siyan and Shen, Siman and Peng, Yi-Hao and Nichols, Jeffrey and Bigham, Jeffrey P},
  booktitle={Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems},
  pages={1--14},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
crawler		crawler
downloads		downloads
models		models
notebooks		notebooks
sample/1656554031731		sample/1656554031731
scripts		scripts
COPYRIGHT.txt		COPYRIGHT.txt
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics

Description

Repository Structure

Getting Started

Important Notes

Citation

About

Releases

Packages

Languages

License

CedrickMakhlouf/webui

Folders and files

Latest commit

History

Repository files navigation

WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics

Description

Repository Structure

Getting Started

Important Notes

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages