For a Cautious Use of Big Data and Computation

Abstract : The so-called big data revolution resides as much in the availability of large datasets of novel and various types as in the always increasing available computational power. Although the computational shift (Arthur (2015)) is central for a science aware of complexity and is undeniably the basis of future modeling practices in geography as Banos (2013) points out, we argue that both data deluge and computational potentialities are dangerous if not framed into a proper theoretical and formal framework. The first may bias research directions towards available datasets (as e.g. numerous twitter mobility studies) with the risk to disconnect from a theoretical background, whereas the second may overshadow preliminaries analytical resolutions essential for a consistent use of simulations. We illustrate this idea with an example on well-studied geographical objects that are interactions between networks and territories. We compute static correlations between indicators of urban form and indicators of road network topology, using open datasets of european population density (EUROSTAT (2014)) and OpenStreetMap. A mathematical derivation of the link between spatial covariance at fixed time and dynamical covariance for spatio-temporal stochastic processes, combined with a theory of city-transportation interactions within evolutive urban systems on long times (Bretagnolle (2009)), allows to infer knowledge on involved geographical processes from empirical static correlations. In particular we show the regional nature of dynamic interactions, confirming the non-ergodicity of urban systems (Pumain (2012)). We argue that the two conditions for this result are indeed the ones endangered by incautious big-data enthusiasm, concluding that a main challenge for future Geocomputation is a wise integration of novel practices within the existing body of knowledge.
Document type :
Conference papers
Complete list of metadatas

https://halshs.archives-ouvertes.fr/halshs-01358440
Contributor : Juste Raimbault <>
Submitted on : Wednesday, August 31, 2016 - 5:41:34 PM
Last modification on : Wednesday, July 17, 2019 - 2:34:45 AM

Identifiers

  • HAL Id : halshs-01358440, version 1

Citation

Juste Raimbault. For a Cautious Use of Big Data and Computation. Royal Geographical Society - Annual Conference 2016 - Session : Geocomputation, the Next 20 Years (1), Aug 2016, London, United Kingdom. ⟨halshs-01358440⟩

Share

Metrics

Record views

407

Files downloads

92