UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Statistical models for point-counting data

Vermeesch, P; (2018) Statistical models for point-counting data. Earth and Planetary Science Letters , 501 pp. 112-118. 10.1016/j.epsl.2018.08.019. Green open access

[thumbnail of Vermeesch_Statistical models for point-counting data_AAM.pdf]
Preview
Text
Vermeesch_Statistical models for point-counting data_AAM.pdf - Accepted Version

Download (487kB) | Preview

Abstract

Point-counting data are a mainstay of petrography, micropalaeontology and palynology. Conventional statistical analysis of such data is fraught with problems. Commonly used statistics such as the arithmetic mean and standard deviation may produce nonsensical results when applied to point-counting data. This paper makes the case that point-counts represent a distinct class of data that requires different treatment. Point-counts are affected by a combination of (1) true compositional variability and (2) multinomial counting uncertainties. The relative magnitude of these two sources of dispersion can be assessed by a chi-square statistic and test. For datasets that pass the chi-square test for homogeneity, the ‘pooled’ composition is shown to represent the optimal estimate for the underlying population. It is obtained by simply adding together the counts of all samples and normalising the resulting values to unity. However, more often than not, point-counting datasets fail the chi-square test. The overdispersion of such datasets can be captured by a random effects model that combines a logistic normal population with the usual multinomial counting uncertainties. This gives rise to the concept of a ‘central’ composition as a more appropriate way to average overdispersed data. Two- or three-component datasets can be displayed on radial plots and ternary diagrams, respectively. Higher dimensional datasets may be visualised and interpreted by Correspondence Analysis (CA). This is a multivariate ordination technique that is similar in purpose to Principal Component Analysis (PCA). CA and PCA are both shown to be special cases of Multidimensional Scaling (MDS). Generalising this insight to multiple datasets allows point-counting data to be combined with other data types such as chemical compositions by means of 3-way MDS. All the techniques introduced in this paper have been implemented in the provenance R-package, which is available from http://provenance.london-geochron.com.

Type: Article
Title: Statistical models for point-counting data
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.epsl.2018.08.019
Publisher version: https://doi.org/10.1016/j.epsl.2018.08.019
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: statistics, point-counting, heavy mineral analysis, petrography, micropalaeontology, palynology
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Earth Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/10059377
Downloads since deposit
221Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item