Geochemical data are typically reported as compositions, in the form of some proportions such as weight percents, parts per million, etc., subject to a constant sum (e.g. 100%, 1,000,000 ppm). This latter implies that such data are “closed”; that is, for a composition of D-components, only D-1 components are required. The statistical analysis of compositional data has been a major issue for more than 100 years. The problem of spurious correlation, introduced by Karl Pearson in 1897, affects all data measuring parts of some whole, which are by definition, constrained; and such type of measurements are present in all fields of geochemical research. The use of the log-ratio transform was introduced by John Aitchison to overcome these constraints by opening the data into the real number space, within which standard statistical methods can be applied. However, many statisticians and users of statistics in the field of geochemistry are unaware of the problems affecting compositional data, as well as solutions that overcome these problems. A look into the ISI Web of Science and Scopus databases shows that most papers where compositional data are the core of a geochemical research continue to ignore methods to correctly manage constrained data. A key question is how we can demonstrate that the interpretation of the behaviour of chemical species in natural environment and in geochemical processes is improved when the compositional constraint of geochemical data is taken into account through the use of new methods. In order to achieve this aim, this special issue of the Journal of Geochemical Exploration focuses on the correct statistical analysis of compositional data. Applications in exploration, monitoring and environments by considering several geological matrices are presented and discussed illustrating that several paths can be followed to understand how geochemical processes work.

Compositional data analysis in geochemistry: Are we sure to see what really occurs during natural processes? / Buccianti A.; Grunsky E.. - In: JOURNAL OF GEOCHEMICAL EXPLORATION. - ISSN 0375-6742. - STAMPA. - 141:(2014), pp. 1-5. [10.1016/j.gexplo.2014.03.022]

Compositional data analysis in geochemistry: Are we sure to see what really occurs during natural processes?

BUCCIANTI, ANTONELLA;
2014

Abstract

Geochemical data are typically reported as compositions, in the form of some proportions such as weight percents, parts per million, etc., subject to a constant sum (e.g. 100%, 1,000,000 ppm). This latter implies that such data are “closed”; that is, for a composition of D-components, only D-1 components are required. The statistical analysis of compositional data has been a major issue for more than 100 years. The problem of spurious correlation, introduced by Karl Pearson in 1897, affects all data measuring parts of some whole, which are by definition, constrained; and such type of measurements are present in all fields of geochemical research. The use of the log-ratio transform was introduced by John Aitchison to overcome these constraints by opening the data into the real number space, within which standard statistical methods can be applied. However, many statisticians and users of statistics in the field of geochemistry are unaware of the problems affecting compositional data, as well as solutions that overcome these problems. A look into the ISI Web of Science and Scopus databases shows that most papers where compositional data are the core of a geochemical research continue to ignore methods to correctly manage constrained data. A key question is how we can demonstrate that the interpretation of the behaviour of chemical species in natural environment and in geochemical processes is improved when the compositional constraint of geochemical data is taken into account through the use of new methods. In order to achieve this aim, this special issue of the Journal of Geochemical Exploration focuses on the correct statistical analysis of compositional data. Applications in exploration, monitoring and environments by considering several geological matrices are presented and discussed illustrating that several paths can be followed to understand how geochemical processes work.
2014
141
1
5
Buccianti A.; Grunsky E.
File in questo prodotto:
File Dimensione Formato  
Buccianti-Grunsky 2014.pdf

Accesso chiuso

Tipologia: Versione finale referata (Postprint, Accepted manuscript)
Licenza: Tutti i diritti riservati
Dimensione 318.87 kB
Formato Adobe PDF
318.87 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/850701
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 102
  • ???jsp.display-item.citation.isi??? 106
social impact