ELBO scaling #1799

InfProbSciX · 2021-10-26T12:42:25Z

InfProbSciX
Oct 26, 2021

I noticed that for a gplvm, the numerical value of the elbo (VariationalELBO) doesn't change much at all w.r.t. the number of data points. Is the ELBO scaled internally so that it is the mean across data?

wjmaddox · 2021-10-26T12:44:48Z

wjmaddox
Oct 26, 2021
Collaborator

Yes, in general it's meaned across the data. See the type of scaling in

gpytorch/gpytorch/mlls/_approximate_mll.py

Line 57 in fc2053b

    
           log_likelihood = self._log_likelihood_term(approximate_dist_f, target, **kwargs).div(num_batch)

0 replies

InfProbSciX · 2021-10-26T12:54:29Z

InfProbSciX
Oct 26, 2021
Author

Thank you - why is this the case? Also, would you mind a PR from me adding this detail to the documentation at https://docs.gpytorch.ai/en/latest/_modules/gpytorch/mlls/variational_elbo.html?

1 reply

gpleiss Oct 26, 2021
Maintainer

A PR would be great!

wjmaddox · 2021-10-26T12:56:38Z

wjmaddox
Oct 26, 2021
Collaborator

Sure. Averaging across data points is also done for the MLL and I believe it's to keep everything on the same scale as NN losses in pytorch (which are also traditionally averaged across the batch) with the end goal being easy interpretability of torch.optim learning rates, etc.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ELBO scaling #1799

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

ELBO scaling #1799

InfProbSciX Oct 26, 2021

Replies: 3 comments · 1 reply

wjmaddox Oct 26, 2021 Collaborator

InfProbSciX Oct 26, 2021 Author

gpleiss Oct 26, 2021 Maintainer

wjmaddox Oct 26, 2021 Collaborator

InfProbSciX
Oct 26, 2021

Replies: 3 comments 1 reply

wjmaddox
Oct 26, 2021
Collaborator

InfProbSciX
Oct 26, 2021
Author

gpleiss Oct 26, 2021
Maintainer

wjmaddox
Oct 26, 2021
Collaborator