Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds

Research output: Contribution to journal › Conference article › Research › peer-review

Standard

Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds. / Liévin, Valentin; Dittadi, Andrea; Christensen, Anders; Winther, Ole.

In: Advances in Neural Information Processing Systems, Vol. 2020-December, 2020.

Research output: Contribution to journal › Conference article › Research › peer-review

Harvard

Liévin, V, Dittadi, A, Christensen, A & Winther, O 2020, 'Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds', Advances in Neural Information Processing Systems, vol. 2020-December. <https://papers.nips.cc/paper/2020/hash/c15203a83f778ce8934d0efaf2d5c6f3-Abstract.html>

APA

Liévin, V., Dittadi, A., Christensen, A., & Winther, O. (2020). Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds. Advances in Neural Information Processing Systems, 2020-December. https://papers.nips.cc/paper/2020/hash/c15203a83f778ce8934d0efaf2d5c6f3-Abstract.html

Vancouver

Liévin V, Dittadi A, Christensen A, Winther O. Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds. Advances in Neural Information Processing Systems. 2020;2020-December.

Author

Liévin, Valentin ; Dittadi, Andrea ; Christensen, Anders ; Winther, Ole. / Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds. In: Advances in Neural Information Processing Systems. 2020 ; Vol. 2020-December.

Bibtex

@inproceedings{30bdedeca4ba43a080665a860e0ade89,

title = "Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds",

abstract = "This paper introduces novel results for the score function gradient estimator of the importance weighted variational bound (IWAE). We prove that in the limit of large K (number of importance samples) one can choose the control variate such that the Signal-to-Noise ratio (SNR) of the estimator grows as pK. This is in contrast to the standard pathwise gradient estimator where the SNR decreases as 1/pK. Based on our theoretical findings we develop a novel control variate that extends on VIMCO. Empirically, for the training of both continuous and discrete generative models, the proposed method yields superior variance reduction, resulting in an SNR for IWAE that increases with K without relying on the reparameterization trick. The novel estimator is competitive with state-of-the-art reparameterization-free gradient estimators such as Reweighted Wake-Sleep (RWS) and the thermodynamic variational objective (TVO) when training generative models.",

author = "Valentin Li{\'e}vin and Andrea Dittadi and Anders Christensen and Ole Winther",

note = "Publisher Copyright: {\textcopyright} 2020 Neural information processing systems foundation. All rights reserved.; 34th Conference on Neural Information Processing Systems, NeurIPS 2020 ; Conference date: 06-12-2020 Through 12-12-2020",

year = "2020",

language = "English",

volume = "2020-December",

journal = "Advances in Neural Information Processing Systems",

issn = "1049-5258",

publisher = "Morgan Kaufmann Publishers, Inc",

}

RIS

TY - GEN

T1 - Optimal Variance Control of the Score-Function Gradient Estimator for Importance Weighted Bounds

AU - Liévin, Valentin

AU - Dittadi, Andrea

AU - Christensen, Anders

AU - Winther, Ole

PY - 2020

Y1 - 2020

N2 - This paper introduces novel results for the score function gradient estimator of the importance weighted variational bound (IWAE). We prove that in the limit of large K (number of importance samples) one can choose the control variate such that the Signal-to-Noise ratio (SNR) of the estimator grows as pK. This is in contrast to the standard pathwise gradient estimator where the SNR decreases as 1/pK. Based on our theoretical findings we develop a novel control variate that extends on VIMCO. Empirically, for the training of both continuous and discrete generative models, the proposed method yields superior variance reduction, resulting in an SNR for IWAE that increases with K without relying on the reparameterization trick. The novel estimator is competitive with state-of-the-art reparameterization-free gradient estimators such as Reweighted Wake-Sleep (RWS) and the thermodynamic variational objective (TVO) when training generative models.

AB - This paper introduces novel results for the score function gradient estimator of the importance weighted variational bound (IWAE). We prove that in the limit of large K (number of importance samples) one can choose the control variate such that the Signal-to-Noise ratio (SNR) of the estimator grows as pK. This is in contrast to the standard pathwise gradient estimator where the SNR decreases as 1/pK. Based on our theoretical findings we develop a novel control variate that extends on VIMCO. Empirically, for the training of both continuous and discrete generative models, the proposed method yields superior variance reduction, resulting in an SNR for IWAE that increases with K without relying on the reparameterization trick. The novel estimator is competitive with state-of-the-art reparameterization-free gradient estimators such as Reweighted Wake-Sleep (RWS) and the thermodynamic variational objective (TVO) when training generative models.

UR - http://www.scopus.com/inward/record.url?scp=85100493425&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85100493425

VL - 2020-December

JO - Advances in Neural Information Processing Systems

JF - Advances in Neural Information Processing Systems

SN - 1049-5258

T2 - 34th Conference on Neural Information Processing Systems, NeurIPS 2020

Y2 - 6 December 2020 through 12 December 2020

ER -

ID: 276208639