Visual Salience and Perceptual Grouping in Multimodal Interactivity

Frédéric Landragin; Nadia Bellalem; Laurent Romary

Communication dans un congrès Année : 2001

Visual Salience and Perceptual Grouping in Multimodal Interactivity

(1) , (1) , (1)

Frédéric Landragin

Fonction : Auteur
PersonId : 5570
IdHAL : frederic-landragin
IdRef : 071347321

Human-machine dialogue with a significant language component

Nadia Bellalem

Fonction : Auteur

Human-machine dialogue with a significant language component

Laurent Romary

Fonction : Auteur
PersonId : 307
IdHAL : laurentromary
ORCID : 0000-0002-0756-0508
IdRef : 060702494

Human-machine dialogue with a significant language component

Résumé

This paper deals with the pragmatic interpretation of multimodal referring expressions in man-machine dialogue systems. We show the importance of building up a structure of the visual context at a semantic level, in order to enrich the significant possibilities of interpretations and to make possible the fusion of this structure with the ones obtained from the linguistic and gesture semantic analyses. Visual salience and perceptual grouping are two notions that guide such a structuring. We thus propose a hierarchy of salience criteria linked to an algorithm that detects salient objects, as well as guidelines for grouping algorithms. We show how the integration of the results of all these algorithms is a complex problem. We propose simple heuristics to reduce this complexity and we conclude on the usability of such heuristics in actual systems.

Mots clés

Multimodal interaction context modeling visual perception visual salience perceptual grouping Gestalt theory

Domaines

Sciences de l'information et de la communication Interface homme-machine [cs.HC] Multimédia [cs.MM] Informatique

Liste complète des métadonnées

Format du dépôt	Fichier
Type de dépôt	Communication dans un congrès
Titre	en Visual Salience and Perceptual Grouping in Multimodal Interactivity
Résumé	en This paper deals with the pragmatic interpretation of multimodal referring expressions in man-machine dialogue systems. We show the importance of building up a structure of the visual context at a semantic level, in order to enrich the significant possibilities of interpretations and to make possible the fusion of this structure with the ones obtained from the linguistic and gesture semantic analyses. Visual salience and perceptual grouping are two notions that guide such a structuring. We thus propose a hierarchy of salience criteria linked to an algorithm that detects salient objects, as well as guidelines for grouping algorithms. We show how the integration of the results of all these algorithms is a complex problem. We propose simple heuristics to reduce this complexity and we conclude on the usability of such heuristics in actual systems.
Auteur(s)	Frédéric Landragin ¹ , Nadia Bellalem ¹ , Laurent Romary ¹ 1 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component ( 2351 ) - France INRIA Lorraine ( 2496 ) ; Institut National de Recherche en Informatique et en Automatique ( 300009 ) ; Laboratoire Lorrain de Recherche en Informatique et ses Applications ( 466633 ) ; Institut National de Recherche en Informatique et en Automatique ( 300009 ) ; Université Henri Poincaré - Nancy 1 ( 300291 ) ; Université Nancy 2 ( 300292 ) ; Institut National Polytechnique de Lorraine ( 300293 ) ; Centre National de la Recherche Scientifique UMR7503 ( 441569 )
Date de publication	2001
Page/Identifiant	151-155
Titre du congrès	First International Workshop on Information Presentation and Natural Multimodal Dialogue
Date début congrès	2001
Ville	Verona
Pays	Italie
Langue du document	Anglais
Date de production/écriture	2001
Vulgarisation	Non
Comité de lecture	Oui
Invité	Non
Audience	Non spécifiée
Actes	Oui
Domaine(s)	Sciences de l'Homme et Société/Sciences de l'information et de la communication Informatique [cs]/Interface homme-machine [cs.HC] Informatique [cs]/Multimédia [cs.MM] Sciences cognitives/Informatique
Mots-clés	en Multimodal interaction, context modeling, visual perception, visual salience, perceptual grouping, Gestalt theory

Fichier principal

landragin.pdf ( 88.32 Ko )

Frédéric Landragin : Connectez-vous pour contacter le contributeur

https://shs.hal.science/inria-00100576

Soumis le : mardi 7 novembre 2006 à 16:47:52

Dernière modification le : vendredi 24 mars 2023 à 14:52:48

Archivage à long terme le : mardi 6 avril 2010 à 21:48:04

Dates et versions

inria-00100576, version 1 (26-09-2006)

inria-00100576, version 2 (07-11-2006)

Identifiants

HAL Id : inria-00100576 , version 2

Citer

Frédéric Landragin, Nadia Bellalem, Laurent Romary. Visual Salience and Perceptual Grouping in Multimodal Interactivity. First International Workshop on Information Presentation and Natural Multimodal Dialogue, 2001, Verona, Italy. pp.151-155. ⟨inria-00100576v2⟩

Exporter

BibTeX TEI Dublin Core DC Terms EndNote Datacite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

445 Consultations

448 Téléchargements

Dernière date de mise à jour le 20/04/2024

Visual Salience and Perceptual Grouping in Multimodal Interactivity

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager