This repository contains web crawling and analysis code for the IMC 2024 paper "Browsing without Third-Party Cookies: What Do You See?".
Only Ubuntu 20.04 LTS is officially supported.
To create the cookie
conda environment, execute:
conda env create -f environment.yml
Activate the environment with:
conda activate cookie
To start a crawl, execute:
python3 sbatch_main.py --jobs <number of slurm jobs>
If you do not have Slurm, you can start a single job using main.py
.
After crawling, use extract_differences.py
to compute differences in extracted features.
To analyze the differences, use classification_algo.ipynb
.