Clustering Flood Events from Water Quality Time-Series using Latent Dirichlet Allocation Model

Alice Aubert 1, * Romain Tavenard 2 Rémi Emonet 3 Alban De Lavenne 1 Simon Malinowski 4 Thomas Guyet 4 René Quiniou 4 Jean-Marc Odobez 3 Philippe Mérot 1 Chantal Gascuel-Odoux 1
* Auteur correspondant
2 LETG - Rennes - Littoral, Environnement, Télédétection, Géomatique
LETG - Littoral, Environnement, Télédétection, Géomatique UMR 6554
4 DREAM - Diagnosing, Recommending Actions and Modelling
Abstract : To improve hydro-chemical modeling and forecasting, there is a need to better understand flood-induced variability in water chemistry and the processes controlling it in watersheds. In the literature, assumptions are often made, for instance, that stream chemistry reacts differently to rainfall events depending on the season; however, methods to verify such assumptions are not well developed. Often, few floods are studied at a time and chemicals are used as tracers. Grouping similar events from large multivariate datasets using principal component analysis and clustering methods helps to explain hydrological processes; however, these methods currently have some limits (definition of flood descriptors, linear assumption, for instance). Most clustering methods have been used in the context of regionalization, focusing more on mapping results than on understanding processes. In this study, we extracted flood patterns using the probabilistic Latent Dirichlet Allocation (LDA) model, its first use in hydrology, to our knowledge. The LDA method allows multivariate temporal datasets to be considered without having to define explanatory factors beforehand or select representative floods. We analyzed a multivariate dataset from a long-term observatory (Kervidy-Naizin, western France) containing data for four solutes monitored daily for 12 years: nitrate, chloride, dissolved organic carbon, and sulfate. The LDA method extracted four different patterns that were distributed by season. Each pattern can be explained by seasonal hydrological processes. Hydro-meteorological parameters help explain the processes leading to these patterns, which increases understanding of flood-induced variability in water quality. Thus, the LDA method appears useful for analyzing long-term datasets.
Type de document :
Article dans une revue
Water Resources Research, American Geophysical Union, 2013, 49 (12), pp.8187-8199. 〈10.1002/2013WR014086〉
Liste complète des métadonnées

Littérature citée [37 références]  Voir  Masquer  Télécharger
Contributeur : Romain Tavenard <>
Soumis le : vendredi 29 novembre 2013 - 12:14:35
Dernière modification le : jeudi 7 février 2019 - 17:35:37
Document(s) archivé(s) le : lundi 3 mars 2014 - 14:26:28


Fichiers produits par l'(les) auteur(s)



Alice Aubert, Romain Tavenard, Rémi Emonet, Alban De Lavenne, Simon Malinowski, et al.. Clustering Flood Events from Water Quality Time-Series using Latent Dirichlet Allocation Model. Water Resources Research, American Geophysical Union, 2013, 49 (12), pp.8187-8199. 〈10.1002/2013WR014086〉. 〈halshs-00906292〉



Consultations de la notice


Téléchargements de fichiers