Visual Salience and Perceptual Grouping in Multimodal Interactivity

Frédéric Landragin 1 Nadia Bellalem 1 Laurent Romary 1
1 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper deals with the pragmatic interpretation of multimodal referring expressions in man-machine dialogue systems. We show the importance of building up a structure of the visual context at a semantic level, in order to enrich the significant possibilities of interpretations and to make possible the fusion of this structure with the ones obtained from the linguistic and gesture semantic analyses. Visual salience and perceptual grouping are two notions that guide such a structuring. We thus propose a hierarchy of salience criteria linked to an algorithm that detects salient objects, as well as guidelines for grouping algorithms. We show how the integration of the results of all these algorithms is a complex problem. We propose simple heuristics to reduce this complexity and we conclude on the usability of such heuristics in actual systems.
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://halshs.archives-ouvertes.fr/inria-00100576
Contributor : Frédéric Landragin <>
Submitted on : Tuesday, November 7, 2006 - 4:47:52 PM
Last modification on : Tuesday, July 23, 2019 - 4:14:09 PM
Long-term archiving on : Tuesday, April 6, 2010 - 9:48:04 PM

Identifiers

  • HAL Id : inria-00100576, version 2

Collections

Citation

Frédéric Landragin, Nadia Bellalem, Laurent Romary. Visual Salience and Perceptual Grouping in Multimodal Interactivity. First International Workshop on Information Presentation and Natural Multimodal Dialogue, 2001, pp.151-155. ⟨inria-00100576v2⟩

Share

Metrics

Record views

575

Files downloads

382