Why not compute consistency on the raw features or predictions directly #84

MLDeS · 2024-04-17T11:58:20Z

Hi All,

Thanks for the nice work.

I have a question regarding the depiction in Figure 1. Why do compute the consistency loss after sharpening the predictions? Why not minimize a form of KL divergence from the model features or raw predictions. Did you observe that the sharpened form lead to better training? Or what was the rationale?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why not compute consistency on the raw features or predictions directly #84

Why not compute consistency on the raw features or predictions directly #84

MLDeS commented Apr 17, 2024

Why not compute consistency on the raw features or predictions directly #84

Why not compute consistency on the raw features or predictions directly #84

Comments

MLDeS commented Apr 17, 2024