Skip to main navigation Skip to search Skip to main content

αPOMDP: POMDP-based user-adaptive decision-making for social robots

  • Gonçalo S. Martins
  • , Hend Al Tair
  • , Luís Santos
  • , Jorge Dias
  • University of Coimbra, Institute of Systems and Robotics

Research output: Contribution to journalArticlepeer-review

19 Scopus citations

Abstract

In this work we present αPOMDP: a User-Adaptive Decision-Making technique for social robots. This technique is based on the classical POMDP formulation which we extend with novel aspects inspired by Reward Shaping and Model-Based Reinforcement Learning. Our technique innovates in two main ways: by applying a novel set of rewarding schemes based on the state of the user and by employing a novel execution loop that enables the system to learn the impact of its actions on the user on-the-fly. Our technique has been tested with multiple POMDP solvers and reward formulations in simulations and with real users through the GrowMu social robot. Results show that our technique is able to correctly decide which actions to take, maintaining the user in positive states which interacting with the robot and methodically exploring and learning their characteristics, activities and behaviors.

Original languageBritish English
Pages (from-to)94-103
Number of pages10
JournalPattern Recognition Letters
Volume118
DOIs
StatePublished - 1 Feb 2019

Keywords

  • Automated planning
  • Decision making
  • Machine learning
  • POMDPs
  • Social robots

Fingerprint

Dive into the research topics of 'αPOMDP: POMDP-based user-adaptive decision-making for social robots'. Together they form a unique fingerprint.

Cite this