TY - GEN
T1 - Subspace Inference in SDE-Net for Bayesian Deep Neural Networks
AU - Wang, Yong Guang
AU - Yao, Shu Zhen
AU - Tan, Huo Bin
N1 - Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - Bayesian inference was once a gold standard for uncertainty estimation with neural networks. However, Bayesian inference is inefficient in high-dimensional parameter spaces of Deep Neural Networks (DNNs). A novel Non-Bayesian Neural Stochastic Differential Equation (SDE-Net) model quantifies epistemic uncertainty of DNNs from the perspective of dynamical systems. In this paper, we propose a subspace inference procedure in Non-Bayesian SDE-Net method for uncertainty estimation from Bayesian perspectives. This procedure contains two steps, first of all, the low-dimensional parameter subspaces of SDE-Net model are constructed to generate the first principal components from Stochastic Gradient Descent (SGD) trajectories, which involve various high-performance models. Secondly, Variational Inference (VI) and Elliptical Slice Sampling (ESS) methods are implemented in the constructed subspace to explore in the full parameter spaces for uncertainty estimation. The experimental results of Bayesian average on the derived posterior in the generated principal subspaces of model parameters show that accurate and well-calibrated results can be obtained for regression and classification tasks.
AB - Bayesian inference was once a gold standard for uncertainty estimation with neural networks. However, Bayesian inference is inefficient in high-dimensional parameter spaces of Deep Neural Networks (DNNs). A novel Non-Bayesian Neural Stochastic Differential Equation (SDE-Net) model quantifies epistemic uncertainty of DNNs from the perspective of dynamical systems. In this paper, we propose a subspace inference procedure in Non-Bayesian SDE-Net method for uncertainty estimation from Bayesian perspectives. This procedure contains two steps, first of all, the low-dimensional parameter subspaces of SDE-Net model are constructed to generate the first principal components from Stochastic Gradient Descent (SGD) trajectories, which involve various high-performance models. Secondly, Variational Inference (VI) and Elliptical Slice Sampling (ESS) methods are implemented in the constructed subspace to explore in the full parameter spaces for uncertainty estimation. The experimental results of Bayesian average on the derived posterior in the generated principal subspaces of model parameters show that accurate and well-calibrated results can be obtained for regression and classification tasks.
KW - Elliptical Slice Sampling
KW - Neural Stochastic Differential Equation
KW - Subspace Inference
KW - Uncertainty Estimation
KW - Variational Inference
UR - https://www.scopus.com/pages/publications/85128107053
U2 - 10.1109/CAC53003.2021.9727592
DO - 10.1109/CAC53003.2021.9727592
M3 - 会议稿件
AN - SCOPUS:85128107053
T3 - Proceeding - 2021 China Automation Congress, CAC 2021
SP - 5464
EP - 5469
BT - Proceeding - 2021 China Automation Congress, CAC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 China Automation Congress, CAC 2021
Y2 - 22 October 2021 through 24 October 2021
ER -