arXiv Analytics

Sign in

arXiv:2411.00273 [cs.LG]AbstractReferencesReviewsResources

Efficient Model Compression for Bayesian Neural Networks

Diptarka Saha, Zihe Liu, Feng Liang

Published 2024-11-01Version 1

Model Compression has drawn much attention within the deep learning community recently. Compressing a dense neural network offers many advantages including lower computation cost, deployability to devices of limited storage and memories, and resistance to adversarial attacks. This may be achieved via weight pruning or fully discarding certain input features. Here we demonstrate a novel strategy to emulate principles of Bayesian model selection in a deep learning setup. Given a fully connected Bayesian neural network with spike-and-slab priors trained via a variational algorithm, we obtain the posterior inclusion probability for every node that typically gets lost. We employ these probabilities for pruning and feature selection on a host of simulated and real-world benchmark data and find evidence of better generalizability of the pruned model in all our experiments.

Related articles:
arXiv:2405.17522 [cs.LG] (Published 2024-05-27)
Efficient Model Compression for Hierarchical Federated Learning
arXiv:2207.10702 [cs.LG] (Published 2022-07-21)
Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing
arXiv:2307.09994 [cs.LG] (Published 2023-07-19)
Impact of Disentanglement on Pruning Neural Networks