UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Vector-valued Distribution Regression - Keep It Simple and Consistent

Szabo, Z; Sriperumbudur, B; Poczos, B; Gretton, A; (2015) Vector-valued Distribution Regression - Keep It Simple and Consistent. Presented at: CSML reading group, Department of Statistics, University of Oxford, Oxford, United Kingdom. Green open access

[thumbnail of Zoltan_Szabo_invited_talk_University_of_Warwick_PPT.pdf]
Preview
Text
Zoltan_Szabo_invited_talk_University_of_Warwick_PPT.pdf

Download (1MB) | Preview

Abstract

We tackle the distribution regression problem (DRP): regressing from probability measures to vector-valued outputs in the two-stage sampled case, where the input distributions are only available through samples. Numerous important and challenging machine learning and statistical tasks fit into the studied problem family such as multi-instance learning or point estimation tasks. Although there is a vast number of heuristics in the literature to address the DRP problem, to the best of our knowledge the only existing technique with performance guarantees requires density estimation (which often scales poorly in practice) and the distributions to have densities on compact Euclidean domains. In my talk, I am going to present a simple alternative to solve the DRP problem by embedding the input distributions to a reproducing kernel Hilbert space, followed by ridge regression from the embeddings to the outputs. We prove that the proposed approach is consistent: we derive finite sample excess risk bounds which hold with high probability and establish explicit convergence rates as a function of the problem difficulty and sample numbers. Specifically, we justify the applicability of set kernels in regression, which was a 15-year-old open question, and construct alternative kernels on the embedded distributions. The studied scheme is viable under mild conditions, on separable topological domains endowed with kernels. We demonstrate the efficiency of the method in two applications, supervised entropy learning and aerosol optical depth prediction based on multispectral satellite images.

Type: Conference item (Presentation)
Title: Vector-valued Distribution Regression - Keep It Simple and Consistent
Event: CSML reading group, Department of Statistics, University of Oxford
Location: Oxford, United Kingdom
Dates: 01 May 2015
Open access status: An open access version is available from UCL Discovery
Publisher version: http://www.gatsby.ucl.ac.uk/~szabo/talks/invited_t...
Language: English
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neurosci Unit
URI: https://discovery.ucl.ac.uk/id/eprint/1468353
Downloads since deposit
14Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item