Collaborate
Central to this research is the ability for humans and robots to collaborate safely and effectively in shared physical endeavours. In this theme we focus on complex human robot interactions towards completing joint tasks. By design, our Flagships require us to address open issues in human-robot collaboration, including physical interaction, and how synergy in robot and human capabilities can be created and exploited towards a common goal. Drawing on all the capabilities developed in other themes, the focus here is the question of how to integrate uncertain or learned models of human states – including affect and trust – and robot actions into planning approaches capable of actively shaping interactions and providing formal guarantees over joint behaviour.
Rigter M, Lacerda B, & Hawes N
Conference on Neural Information Processing Systems (NeurIPS 2021)
In this work, we consider the problem of risk-averse decision-making in an unknown environment. We pose this problem as optimising the conditional value at risk (CVaR) of the total return in Bayes-adaptive Markov decision processes (MDPs). We show that a policy optimising CVaR in this setting is risk-averse to both the epistemic uncertainty due to the prior distribution over MDPs, and the aleatoric uncertainty due to the inherent stochasticity of MDPs.
![Reinforcementlearningimage reinforcementlearningimage](https://embodiedintelligence.web.ox.ac.uk/sites/default/files/styles/mt_image_medium/public/embodiedintelligence/images/media/reinforcementlearningimage.png?itok=hVqcKlrD)
We reformulate the problem as a two-player stochastic game and propose an approximate algorithm based on Monte Carlo tree search and Bayesian optimisation. Our experiments demonstrate that our approach significantly outperforms baseline approaches for this problem.
![Activeoperatormodelimage activeoperatormodelimage](https://embodiedintelligence.web.ox.ac.uk/sites/default/files/styles/mt_image_medium/public/embodiedintelligence/images/media/activeoperatormodelimage.png?itok=pJ0sjgza)