Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs		configs
data		data
data_utils		data_utils
experiment		experiment
figs		figs
model		model
utils		utils
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
loss.py		loss.py
requirements.txt		requirements.txt
test.SBATCH		test.SBATCH
test_DDP.py		test_DDP.py
train.SBATCH		train.SBATCH
train_DDP.py		train_DDP.py

Repository files navigation

EgoPAT3Dv2: Predicting 3D Action Target from 2D Egocentric Vision for Human-Robot Interaction.

Irving Fang*, Yuzhong Chen*, Yifan Wang*, Jianghan Zhang+, Qiushi Zhang+, Jiali Xu+, Xibo He, Weibo Gao, Hao Su, Yiming Li, Chen Feng

Project Website

Please visit our project website for more information, such as a video presentation.

Environment Setup

The project was developed on Python 3.11.5 and PyTorch 2.1.1 with CUDA 11.8.0 binaries. For more details about the required packages, please take a look at requirements.txt

Dataset

Please visit our Hugging Face repo to access and prepare the dataset.

Training

We used the configuration files in configs to control the hyperparameters during our experiments. For more details on the hyperparameters, please refer to the README for configuration files.

To train a model with specified hyperparameters, please run

python train_DDP.py --config_file [your configuration file]

Note: Our training code is written for distributed training on NYU HPC with PyTorch DDP and singularity container. Please modify the train_DDP.py code to suit your needs. You can refer to the train.SBATCH file for more info on our distributed training setup.

Testing

We used the configuration files in configs to control the hyperparameters during our experiments. For more details on the hyperparameters, please refer to the README for configuration files.

To test a model with specified hyperparameters, please run

python test_DDP.py --config_file [your configuration file]

Note: Slightly different from the training code, our testing code is written for single-card inference. Similarly, you can check out the test.SBATCH file for more info on our HPC testing setup.

Testing Results Visualization

To produce aggregated results similar to what's presented in Table I of the paper, please run

python eval.py --model_name [model name in the config file]

You can also use the --mode parameter to dictate whether the test results are created for the seen or unseen test set.

Note: We are currently updating this file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EgoPAT3Dv2: Predicting 3D Action Target from 2D Egocentric Vision for Human-Robot Interaction.

Project Website

Environment Setup

Dataset

Training

Testing

Testing Results Visualization

About

Releases

Packages

Contributors 3

Languages

ai4ce/EgoPAT3Dv2

Folders and files

Latest commit

History

Repository files navigation

EgoPAT3Dv2: Predicting 3D Action Target from 2D Egocentric Vision for Human-Robot Interaction.

Project Website

Environment Setup

Dataset

Training

Testing

Testing Results Visualization

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages