Clustering Flood Events from Water Quality Time-Series using Latent Dirichlet Allocation Model

To improve hydro-chemical modeling and forecasting, there is a need to better understand flood-induced variability in water chemistry and the processes controlling it in watersheds. In the literature, assumptions are often made, for instance, that stream chemistry reacts differently to rainfall events depending on the season; however, methods to verify such assumptions are not well developed. Often, few floods are studied at a time and chemicals are used as tracers. Grouping similar events from large multivariate datasets using principal component analysis and clustering methods helps to explain hydrological processes; however, these methods currently have some limits (definition of flood descriptors, linear assumption, for instance). Most clustering methods have been used in the context of regionalization, focusing more on mapping results than on understanding processes. In this study, we extracted flood patterns using the probabilistic Latent Dirichlet Allocation (LDA) model, its first use in hydrology, to our knowledge. The LDA method allows multivariate temporal datasets to be considered without having to define explanatory factors beforehand or select representative floods. We analyzed a multivariate dataset from a long-term observatory (Kervidy-Naizin, western France) containing data for four solutes monitored daily for 12 years: nitrate, chloride, dissolved organic carbon, and sulfate. The LDA method extracted four different patterns that were distributed by season. Each pattern can be explained by seasonal hydrological processes. Hydro-meteorological parameters help explain the processes leading to these patterns, which increases understanding of flood-induced variability in water quality. Thus, the LDA method appears useful for analyzing long-term datasets.

Mots clés

temporal pattern extraction time series analysis probabilistic model flood events Water quality

Domaines

Géographie Sciences de l'information et de la communication

Liste complète des métadonnées

Format du dépôt	Fichier
Type de dépôt	Article dans une revue
Titre	en Clustering Flood Events from Water Quality Time-Series using Latent Dirichlet Allocation Model
Résumé	en To improve hydro-chemical modeling and forecasting, there is a need to better understand flood-induced variability in water chemistry and the processes controlling it in watersheds. In the literature, assumptions are often made, for instance, that stream chemistry reacts differently to rainfall events depending on the season; however, methods to verify such assumptions are not well developed. Often, few floods are studied at a time and chemicals are used as tracers. Grouping similar events from large multivariate datasets using principal component analysis and clustering methods helps to explain hydrological processes; however, these methods currently have some limits (definition of flood descriptors, linear assumption, for instance). Most clustering methods have been used in the context of regionalization, focusing more on mapping results than on understanding processes. In this study, we extracted flood patterns using the probabilistic Latent Dirichlet Allocation (LDA) model, its first use in hydrology, to our knowledge. The LDA method allows multivariate temporal datasets to be considered without having to define explanatory factors beforehand or select representative floods. We analyzed a multivariate dataset from a long-term observatory (Kervidy-Naizin, western France) containing data for four solutes monitored daily for 12 years: nitrate, chloride, dissolved organic carbon, and sulfate. The LDA method extracted four different patterns that were distributed by season. Each pattern can be explained by seasonal hydrological processes. Hydro-meteorological parameters help explain the processes leading to these patterns, which increases understanding of flood-induced variability in water quality. Thus, the LDA method appears useful for analyzing long-term datasets.
Auteur(s)	Alice Aubert ¹ , Romain Tavenard ² , Rémi Emonet ³ , Alban de Lavenne ¹ , Simon Malinowski ⁴ , Thomas Guyet ⁴ , René Quiniou ⁴ , Jean-Marc Odobez ³ , Philippe Mérot ¹ , Chantal Gascuel ¹ 1 SAS - Sol Agro et hydrosystème Spatialisation ( 138970 ) - UMR 1069, Sol Agro et Hydrosystème Spatialisation, Batiment 13, Agrocampus Ouest, 65 rue de Saint Brieuc CS 84215 35042 Rennes CEDEX - France Institut National de la Recherche Agronomique UMR1069 ( 92114 ) ; AGROCAMPUS OUEST ( 108028 ) 2 LETG - Rennes - Littoral, Environnement, Télédétection, Géomatique ( 3177 ) - Maison de la Recherche Place du Recteur Henri Le Moal 35043 RENNES CEDEX - France Littoral, Environnement, Télédétection, Géomatique UMR 6554 ( 14266 ) ; Université de Caen Normandie ( 7127 ) ; Normandie Université ( 455934 ) ; Université d'Angers ( 74911 ) ; École Pratique des Hautes Études ( 110691 ) ; Université Paris Sciences et Lettres ( 564132 ) ; Université de Brest ( 300314 ) ; Université de Rennes 2 ( 406201 ) ; Centre National de la Recherche Scientifique UMR 6554 ( 441569 ) ; Institut de Géographie et d'Aménagement Régional de l'Université de Nantes ( 530572 ) ; Université de Nantes 93263 ( 93263 ) 3 IDIAP Research Institute ( 74654 ) - Centre du Parc Rue Marconi 19 PO Box 592 CH - 1920 Martigny Switzerland - Suisse 4 DREAM - Diagnosing, Recommending Actions and Modelling ( 2516 ) - Campus de Beaulieu 35042 Rennes cedex - France Inria Rennes – Bretagne Atlantique ( 419153 ) ; Institut National de Recherche en Informatique et en Automatique ( 300009 ) ; GESTION DES DONNÉES ET DE LA CONNAISSANCE ( 419370 ) ; Institut de Recherche en Informatique et Systèmes Aléatoires ( 105128 ) ; Université de Rennes ( 105160 ) ; Institut National des Sciences Appliquées - Rennes ( 117606 ) ; Institut National des Sciences Appliquées ( 301232 ) ; Université de Bretagne Sud ( 172265 ) ; École normale supérieure - Rennes ( 247362 ) ; Institut National de Recherche en Informatique et en Automatique ( 300009 ) ; Télécom Bretagne ( 301262 ) ; CentraleSupélec ( 411575 ) ; Centre National de la Recherche Scientifique UMR6074 ( 441569 )
Numéro	12
Page/Identifiant	8187–8199
Nom de la revue	Water Resources Research (ISSN : 0043-1397, ISSN électronique : 1944-7973) American Geophysical Union Publié par American Geophysical Union https://www.agu.org/Publish-with-AGU/Publish/AGU-Publications-Scientific-Ethics-and-Integrity
Date de production/écriture	2013
Vulgarisation	Non
Langue du document	Anglais
Volume	49
Date de publication	2013
Audience	Internationale
Comité de lecture	Oui
Public visé	Scientifique
Sous-type de document pour les Articles	Research article
Domaine(s)	Sciences de l'Homme et Société/Géographie Sciences de l'Homme et Société/Sciences de l'information et de la communication
Indexation contrôlée	qualité de l'eau analyse de série temporelle extraction de motifs modèle probabiliste inondation
Mots-clés	en temporal pattern extraction, time series analysis, probabilistic model, flood events, Water quality
DOI	10.1002/2013WR014086
ProdINRA	292987
UT key WOS	000329929100025

Fichier principal

aubert_wrr_final.pdf ( 1.45 Mo )

Origine : Fichiers produits par l'(les) auteur(s)

Romain Tavenard : Connectez-vous pour contacter le contributeur

https://shs.hal.science/halshs-00906292

Soumis le : vendredi 29 novembre 2013 à 12:14:35

Dernière modification le : vendredi 19 avril 2024 à 16:18:56

Archivage à long terme le : lundi 3 mars 2014 à 14:26:28

Dates et versions

halshs-00906292, version 1 (29-11-2013)

Identifiants

HAL Id : halshs-00906292 , version 1
DOI : 10.1002/2013WR014086
PRODINRA : 292987
WOS : 000329929100025

Citer

Alice Aubert, Romain Tavenard, Rémi Emonet, Alban de Lavenne, Simon Malinowski, et al.. Clustering Flood Events from Water Quality Time-Series using Latent Dirichlet Allocation Model. Water Resources Research, 2013, 49 (12), pp.8187-8199. ⟨10.1002/2013WR014086⟩. ⟨halshs-00906292⟩

Exporter

BibTeX TEI Dublin Core DC Terms EndNote Datacite

Collections

UNIV-BREST UNIV-NANTES INSTITUT-TELECOM EC-PARIS EPHE UNIV-RENNES1 UR2-HB CNRS INRIA UNIV-ANGERS INSA-RENNES INRA IRISA LETG LETG-COSTEL UNAM IRISA-D7 COMUE-NORMANDIE INRIA2 PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES2 UNIV-RENNES UNICAEN IGARUN INRAE UR1-MATH-NUM UMR-SAS NANTES-UNIVERSITE INSTITUT-AGRO-RENNES-ANGERS-UMR-IRISA INSTITUT-AGRO-RENNES-ANGERS

1091 Consultations

711 Téléchargements

Dernière date de mise à jour le 20/04/2024