Project homepage: https://uimodeling.github.io/
This repository contains the code and download scripts for the following papers:
- WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics (CHI 2023 - Best Paper Honorable Mention)
- WebUI: A Dataset of Web UIs and Associated Metadata to Support Computational UI Modeling (CHI 2023 Computational UI Workshop)
The WebUI dataset contains 400K web UIs captured over a period of 3 months and cost about $500 to crawl. We grouped web pages together by their domain name, then generated training (70%), validation (10%), and testing (20%) splits. This ensured that similar pages from the same website must appear in the same split. We created four versions of the training dataset. Three of these splits were generated by randomly sampling a subset of the training split: Web-7k, Web-70k, Web-350k. We chose 70k as a baseline size, since it is approximately the size of existing UI datasets. We also generated an additional split (Web-7k-Resampled) to provide a small, higher quality split for experimentation. Web-7k-Resampled was generated using a class-balancing sampling technique, and we removed screens with possible visual defects (e.g., very small, occluded, or invisible elements). The validation and test split was always kept the same.
Information about each directory:
crawler/
- contains code for the crawler used to collect the WebUI datasetdownloads/
- contains scripts to download datasets and pre-trained modelsmodels/
- contains scripts for training and reproducing the experiments in the papernotebooks/
- contains example notebooks for running the modelssample/
- a sample data point from the WebUI datasetscripts/
- data processing, dataset generation, and model export scripts
First, install dependencies with pipenv install
and activate a virtual environment with pipenv shell
Example inference code is found in the notebooks/
directory. To train, check out the models/
directory. Alternatively, check out web demos of the models (no installation required).
- Please see the COPYRIGHT.txt file for information about the data contained within this repository.
- Not all data samples have the same number of files (e.g., same number of device screenshots) due to the fact that the crawler used a timeout during collection
- The dataset released on HuggingFace was filtered using a list of explicit words and therefore contains fewer samples than the experiments originally used in the paper.
@inproceedings{wu2023webui,
title={WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics},
author={Wu, Jason and Wang, Siyan and Shen, Siman and Peng, Yi-Hao and Nichols, Jeffrey and Bigham, Jeffrey P},
booktitle={Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems},
pages={1--14},
year={2023}
}