Biases on variances estimated on large data-sets - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Autre Publication Scientifique Documents de travail du Centre d'Économie de la Sorbonne Année : 2021

Biases on variances estimated on large data-sets

Résumé

The inverse dependency of the estimated variances over the sample size throws a fundamental question on the validity of the usual statistical methodology, since any hypothesis on the value of a coefficient can be tested negatively by increasing the size of the data-set. I suppose that large data-sets are characterized by a concentration of information on homogenous sub-populations, a spatial autocorrelation of the error terms and the covariates may bias the estimation of variances. Using the corrections of variances under spatial autocorrelation, we obtain variances comparable to an estimation on sub-samples (named efficient sub-samples) the sizes of which are sufficient to contain the information which gives rise to similar estimates to those obtained on the whole population. Moreover, the estimation on efficient data-sets does not necessitate the specification of the spatial autocorrelations which are supposed to bias the estimated variances.
Fichier principal
Vignette du fichier
21022.pdf (669.61 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

halshs-03325118 , version 1 (24-08-2021)

Identifiants

  • HAL Id : halshs-03325118 , version 1

Citer

François Gardes. Biases on variances estimated on large data-sets. 2021. ⟨halshs-03325118⟩
280 Consultations
56 Téléchargements

Partager

Gmail Facebook X LinkedIn More