Observational and reinforcement pattern-learning : An exploratory study

Understanding how individuals learn in an unknown environment is an important problem in economics. We model and examine experimentally behavior in a very simple multi-armed bandit framework in which participants do not know the inter-temporal payoff structure. We propose a baseline reinforcement learning model that allows for pattern-recognition and change in the strategy space. We also analyse three augmented versions that accommodate observational learning from the actions and/or payoffs of another player. The models successfully reproduce the distributional properties of observed discovery times and total payoffs. Our study further shows that when one of the pair discovers the hidden pattern, observing another's actions and/or payoffs improves discovery time compared to the baseline case.

Mots clés

Multi-armed bandit Reinforcement learning Payoff patterns Observational learning

Domaines

Economies et finances

Liste complète des métadonnées

Format du dépôt	Fichier
Type de dépôt	Article dans une revue
Titre	en Observational and reinforcement pattern-learning : An exploratory study
Résumé	en Understanding how individuals learn in an unknown environment is an important problem in economics. We model and examine experimentally behavior in a very simple multi-armed bandit framework in which participants do not know the inter-temporal payoff structure. We propose a baseline reinforcement learning model that allows for pattern-recognition and change in the strategy space. We also analyse three augmented versions that accommodate observational learning from the actions and/or payoffs of another player. The models successfully reproduce the distributional properties of observed discovery times and total payoffs. Our study further shows that when one of the pair discovers the hidden pattern, observing another's actions and/or payoffs improves discovery time compared to the baseline case.
Auteur(s)	Nobuyuki Hanaki ^{1, 2, 3} , Alan Kirman ^{4, 5, 6} , Paul Pezanis-Christou ^{7, 8} 1 GREDEG - Groupe de Recherche en Droit, Economie et Gestion ( 185786 ) - GREDEG - Bâtiment 2 - Campus Azur du CNRS - 250 rue Albert Einstein - CS 10269 - F 06905 SOPHIA ANTIPOLIS Cedex - France Université Nice Sophia Antipolis (1965 - 2019) ( 117617 ) ; Centre National de la Recherche Scientifique UMR7321 ( 441569 ) ; Université Côte d'Azur UMR7321 ( 1039632 ) 2 CNRS - Centre National de la Recherche Scientifique ( 441569 ) - France 3 UniCA - Université Côte d'Azur ( 1039632 ) - Parc Valrose, 28, avenue Valrose 06108 Nice Cedex 2 - France 4 EHESS - École des hautes études en sciences sociales ( 99539 ) - 54, boulevard Raspail 75006 Paris - France 5 CAMS - Centre d'Analyse et de Mathématique sociales ( 1318 ) - 54 boulevard Raspail 75006 Paris - France École des hautes études en sciences sociales ( 99539 ) ; Centre National de la Recherche Scientifique UMR8557 ( 441569 ) 6 AMU - Aix Marseille Université ( 198056 ) - Aix-Marseille Université Jardins du Pharo 58 Boulevard Charles Livon 13284 Marseille cedex 7 - France 7 BETA - Bureau d'Économie Théorique et Appliquée ( 93745 ) - Université de Lorraine, UFR Droit Sciences Economiques et Gestion, 13 place Carnot CO 70026, 54035 Nancy Cedex Université de Strasbourg, Faculté des Sciences Economiques et de Gestion, 61 avenue de la Forêt Noire 67085 Strasbourg Cedex - France Institut National de la Recherche Agronomique UMR1443 ( 92114 ) ; Université de Strasbourg ( 199013 ) ; Université de Lorraine ( 413289 ) ; Centre National de la Recherche Scientifique UMR7522 ( 441569 ) 8 University of Adelaide ( 116469 ) - Adelaide, South Australia, 5005 Australia - Australie
Nom de la revue	European Economic Review (ISSN : 0014-2921) Elsevier Publié par Elsevier https://www.sciencedirect.com/journal/european-economic-review
Langue du document	Anglais
Page/Identifiant	1 - 21
Volume	104
Date de publication	2018-05
Audience	Internationale
Comité de lecture	Oui
Vulgarisation	Non
Mots-clés (JEL)	D - Microeconomics/D.D8 - Information, Knowledge, and Uncertainty/D.D8.D81 - Criteria for Decision-Making under Risk and Uncertainty D - Microeconomics/D.D8 - Information, Knowledge, and Uncertainty/D.D8.D83 - Search • Learning • Information and Knowledge • Communication • Belief • Unawareness
Domaine(s)	Sciences de l'Homme et Société/Economies et finances
Projet(s) ANR	Fondations comportementales et cognitives de la modélisation mutli-agents [En savoir plus] BECOA - ANR-11-FRJA-0002 CHORUS - 2011 Analyses comportementales et exp?rimentales en macro-finance [En savoir plus] BEAM - ANR-15-ORAR-0004 ORAR - 2015 INITIATIVE D'EXCELLENCE AIX MARSEILLE UNIVERSITE [En savoir plus] Amidex - ANR-11-IDEX-0001 IDEX - 2011 Idex UCA JEDI [En savoir plus] UCA JEDI - ANR-15-IDEX-0001 IDEX - 2015
Collaboration/Projet	CODIREM
Financement	Australian Research Council (DP140102949)
Mots-clés	en Multi-armed bandit, Reinforcement learning, Payoff patterns, Observational learning
DOI	10.1016/j.euroecorev.2018.01.009

Fichier principal

Manuscript.pdf ( 3.84 Mo )

Origine : Fichiers produits par l'(les) auteur(s)

Nobuyuki Hanaki : Connectez-vous pour contacter le contributeur

https://shs.hal.science/halshs-01723513

Soumis le : lundi 5 mars 2018 à 15:34:33

Dernière modification le : lundi 18 mars 2024 à 10:24:06

Archivage à long terme le : mercredi 6 juin 2018 à 15:53:24

Dates et versions

halshs-01723513, version 1 (05-03-2018)

Identifiants

HAL Id : halshs-01723513 , version 1
DOI : 10.1016/j.euroecorev.2018.01.009

Citer

Nobuyuki Hanaki, Alan Kirman, Paul Pezanis-Christou. Observational and reinforcement pattern-learning : An exploratory study. European Economic Review, 2018, 104, pp.1 - 21. ⟨10.1016/j.euroecorev.2018.01.009⟩. ⟨halshs-01723513⟩

Exporter

BibTeX TEI Dublin Core DC Terms EndNote Datacite

Collections

CNRS UNIV-AMU EHESS INRA GREDEG UNIV-LORRAINE UNIV-COTEDAZUR IDEX-UNIV-COTEDAZUR SITE-ALSACE BETA AMIDEX INRAE ANR

366 Consultations

597 Téléchargements

Dernière date de mise à jour le 20/04/2024