UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

A continuous binning for discrete, sparse and concentrated observations

Prieto Curiel, R; Cabrera Arnau, C; Torres Pinedo, M; González Ramírez, H; Bishop, SR; (2020) A continuous binning for discrete, sparse and concentrated observations. MethodsX , 7 , Article 100709. 10.1016/j.mex.2019.10.020. Green open access

[thumbnail of 1-s2.0-S2215016119302857-main.pdf]
Preview
Text
1-s2.0-S2215016119302857-main.pdf - Published Version

Download (1MB) | Preview

Abstract

Discrete observations from data which are obtained from sparse, and yet concentrated events are often observed (e.g. road accidents or murders). Traditional methods to compute summary statistics often include placing the data in discrete bins but for this type of data this approach often results in large numbers of empty bins for which no function or summary statistic can be computed. Here, a method for dealing with sparse and concentrated observations is constructed, based on a sequence of non-overlapping bins of varying size, which gives a continuous interpolation of data for computing summary statistics of the values for the data, such as the mean. The method presented here overcomes the problem which sparsity and concentration present when computing functions to represent the data. Implementation of the method presented here is facilitated via open access to the code. •A new method for computing functions over sparse and concentrated data is constructed.•The method allows straightforward functions to be computed over partitions of the data, such as the mean, but also more complicated functions, such as coefficients, ratios, correlations, regressions and others.

Type: Article
Title: A continuous binning for discrete, sparse and concentrated observations
Location: Netherlands
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.mex.2019.10.020
Publisher version: https://doi.org/10.1016/j.mex.2019.10.020
Language: English
Additional information: This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Keywords: Continuous binning, Discrete data, Smooth functions evaluated in concentrated observations, Sparse data
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Mathematics
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment > Centre for Advanced Spatial Analysis
URI: https://discovery.ucl.ac.uk/id/eprint/10105512
Downloads since deposit
49Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item