With the growing spread of big data, as a potential source of low-cost and timely data, there is a greater interest in whether and how it is possible to use data from non-probability samples to make inference. The main problem when the data generating mechanism is unknown and/or is presumably very different from random sampling is that estimators of population characteristics must be assumed to be biased, unless convincing evidence to the contrary is provided. Therefore, it becomes necessary to identify indicators/measures of the potential risk of non-random selection bias and to adopt predictive inference methods able to remove this bias. Users registered in the public library system of the city of Florence are a non-random subset of the resident population. Among these, the respondents to a web survey for the evaluation of the libraries’ services are the result of a further process of self-selection. The representativeness of the respondents’ data is investigated in order to evaluate the possibility to make inference on the whole Florentine population.

Assessing the representativeness of non-probability surveys: The case of public library users' survey in Florence / Emilia Rocco, Chiara Bocci, Alessandra Petrucci. - ELETTRONICO. - (2019), pp. 215-215. (Intervento presentato al convegno 12th International Conference of the ERCIM Working Group on Computational and Methodological Statistics (CMStatistics 2019) tenutosi a Londra nel 14 – 16 dicembre 2019).

Assessing the representativeness of non-probability surveys: The case of public library users' survey in Florence

Emilia Rocco
;
Chiara Bocci;Alessandra Petrucci
2019

Abstract

With the growing spread of big data, as a potential source of low-cost and timely data, there is a greater interest in whether and how it is possible to use data from non-probability samples to make inference. The main problem when the data generating mechanism is unknown and/or is presumably very different from random sampling is that estimators of population characteristics must be assumed to be biased, unless convincing evidence to the contrary is provided. Therefore, it becomes necessary to identify indicators/measures of the potential risk of non-random selection bias and to adopt predictive inference methods able to remove this bias. Users registered in the public library system of the city of Florence are a non-random subset of the resident population. Among these, the respondents to a web survey for the evaluation of the libraries’ services are the result of a further process of self-selection. The representativeness of the respondents’ data is investigated in order to evaluate the possibility to make inference on the whole Florentine population.
2019
Book of Abstracts of the CFE 2019 & ERCIM 2019 Conference
12th International Conference of the ERCIM Working Group on Computational and Methodological Statistics (CMStatistics 2019)
Londra
Emilia Rocco, Chiara Bocci, Alessandra Petrucci
File in questo prodotto:
File Dimensione Formato  
Rocco_Bocci_Petrucci_ERCIM_CMS2019abstract.pdf

Accesso chiuso

Tipologia: Pdf editoriale (Version of record)
Licenza: Tutti i diritti riservati
Dimensione 552.97 kB
Formato Adobe PDF
552.97 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1181765
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact