special token "[semantic]" #44

GinnyXiao · 2025-01-20T21:05:37Z

For training your newly released checkpoint supporting semantic level segmentation, you added a special token "[semantic]" before the input prompt in the training data. I was wondering what difference this token makes? You have not modified your model architecture, which gives 1-1 mapping between your prediction and your prompt. The training strategy for referring seg datasets and semantics seg datasets remain the same, except for this token. Why would adding a "[semantic]" token improve your model ability? Would training without it yield similar performance?

Thank you very much in advance!

CoderZhangYx · 2025-01-21T02:52:16Z

When I apply semantic-segmentation datasets like ade20k, I add '[semantic]' because the category name refers to all the instances within the image in those datasets. This is in contrast to original referring datasets where the prompt only refers to a single instance. I apply the special token to balance the conflict, making the model behave better during joint training. By employing semantic segmentation datasets, the model is able to segment multiple objects, and also stuff categories if prompted with '[semantic]'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

special token "[semantic]" #44

special token "[semantic]" #44

GinnyXiao commented Jan 20, 2025

CoderZhangYx commented Jan 21, 2025

special token "[semantic]" #44

special token "[semantic]" #44

Comments

GinnyXiao commented Jan 20, 2025

CoderZhangYx commented Jan 21, 2025