Skip to Main content Skip to Navigation
Conference papers

Clustering of textual networks: analysing open-ended questions in text data of the perception of minerality in wine

Abstract : Open-ended questions are commonly used in sensory analyses, and are usually dealt with by correspondence analysis (CA) of the term-respondent matrix. CA is apt in detecting strong associations between terms and groups of respondents, but less so when the questions are interpreted differently among respondents and, thus, seem to open a polysemic space for the answers. Also, CA offers little flexibility in filtering out irrelevant textual structure, or in controlling the relative contribution of rare versus frequent terms in the overall analysis. This contribution presents methodological extensions of CA together with application on a survey of 1900 responses bearing upon the understanding of the term "minerality" in wine, whose ambiguity is well attested. Clusters of terms, associated to different meanings of "minerality", are successfully retrieved and visualized. Technically, term-respondent matrix generates a weighted undirected network of positive definite edge weights between terms, interpretable as Markov associativities between terms, whose marginals define term weights. Its eigen-structure is intimately related to spectral clustering, as well as to K-means clustering and MDS visualization of chi2 dissimilarities between terms. The associativities can be renormalized by multiplying edge weights by powers of term weights, enabling the analyst to control the contribution of term weights. Also, modularity maximisation, popular for its efficient yet arguably instable clustering properties, is shown to correspond to a variant of spectral clustering for some power of renormalized associativities.
Document type :
Conference papers
Complete list of metadata
Contributor : MSH Dijon Maison des Sciences de l'Homme Connect in order to contact the contributor
Submitted on : Thursday, December 3, 2015 - 5:28:40 PM
Last modification on : Friday, March 12, 2021 - 2:14:01 PM


  • HAL Id : halshs-01237789, version 1


Laurent Gautier, Yves Y. Le Fur, François Bavaud. Clustering of textual networks: analysing open-ended questions in text data of the perception of minerality in wine. 12th Sensometrics 2014 : Data that works in the city that works, The Sensometric Society, Jul 2014, Chicago, United States. ⟨halshs-01237789⟩



Record views