Question on Training Datasets for One-Stage Training in TA-TiTok #79

yingShen-ys · 2025-01-24T23:40:16Z

Thank you for the amazing work on TA-TiTok!

I’m curious about the training process—specifically, what datasets were used during the one-stage training for TA-TiTok? Is it a single dataset or a combination of several public datasets?

If it’s the latter, could you share details about the specific datasets involved?

Looking forward to your response!

TACJu · 2025-01-25T00:29:37Z

Hi,

Thank you for your interest in our work. Regarding the training datasets used for TA-TiTok, as detailed in Table 7 of our paper, we exclusively used DataComp-1B with a resolution filter.

yingShen-ys · 2025-01-25T06:16:16Z

Thank you for the prompt response!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on Training Datasets for One-Stage Training in TA-TiTok #79

Question on Training Datasets for One-Stage Training in TA-TiTok #79

yingShen-ys commented Jan 24, 2025

TACJu commented Jan 25, 2025

yingShen-ys commented Jan 25, 2025

Question on Training Datasets for One-Stage Training in TA-TiTok #79

Question on Training Datasets for One-Stage Training in TA-TiTok #79

Comments

yingShen-ys commented Jan 24, 2025

TACJu commented Jan 25, 2025

yingShen-ys commented Jan 25, 2025