Rivasplata, O;
Kuzborskij, I;
Szepesvári, C;
Shawe-Taylor, J;
(2020)
PAC-Bayes analysis beyond the usual bounds.
In:
Advances in Neural Information Processing Systems 33 (NeurIPS 2020).
Neural Information Processing Systems (NeurIPS): Vancouver, Canada.
Preview |
Text
PAC_Bayes_Analysis_Beyond_the_Usual_Bounds___NeurIPS_2020_full.pdf - Accepted Version Download (449kB) | Preview |
Abstract
We focus on a stochastic learning model where the learner observes a finite set of training examples and the output of the learning process is a data-dependent distribution over a space of hypotheses. The learned data-dependent distribution is then used to make randomized predictions, and the high-level theme addressed here is guaranteeing the quality of predictions on examples that were not seen during training, i.e. generalization. In this setting the unknown quantity of interest is the expected risk of the data-dependent randomized predictor, for which upper bounds can be derived via a PAC-Bayes analysis, leading to PAC-Bayes bounds. Specifically, we present a basic PAC-Bayes inequality for stochastic kernels, from which one may derive extensions of various known PAC-Bayes bounds as well as novel bounds. We clarify the role of the requirements of fixed ‘data-free’ priors, bounded losses, and i.i.d. data. We highlight that those requirements were used to upper-bound an exponential moment term, while the basic PAC-Bayes theorem remains valid without those restrictions. We present three bounds that illustrate the use of data-dependent priors, including one for the unbounded square loss.
Type: | Proceedings paper |
---|---|
Title: | PAC-Bayes analysis beyond the usual bounds |
Event: | 34th Conference on Neural Information Processing Systems |
Open access status: | An open access version is available from UCL Discovery |
Publisher version: | https://papers.nips.cc/paper/2020/hash/c3992e9a68c... |
Language: | English |
Additional information: | This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions. |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Mathematics |
URI: | https://discovery.ucl.ac.uk/id/eprint/10137497 |
Archive Staff Only
View Item |