Quentin Labourey - Multimodal fusions for human detection by a mobile robot

Organized by: 
Quentin Labourey
Quentin Labourey
Detailed information: 

Venue : at the Images and Signal Department of GIPSA-lab, Ampère site, Building D, Room Mont-Blanc. The adress is 11 rue des Mathématiques, 38402 Saint-Martin d'Hères.

Members of the Jury :

  • Véronique BERGE-CHERFAOUI - Maître de conférences - Heudiasyc - Rapporteur
  • Laurent TRASSOUDAINE - Professeur - Institut Pascal - Rapporteur
  • Simon LACROIX - Directeur de recherche - LAAS - Examinateur
  • Didier COQUIN - Professeur - LISTIC - Examinateur
  • Olivier AYCARD - Maitre de conférences - LIG - Directeur de thèse
  • Denis PELLERIN - Professeur - GIPSA-lab - Co-directeur de thèse



In this work, we consider the case of mobile robot that aims at detecting and positioning itself with respect to humans in its environment. In order to fulfill this mission, the robot is equipped with various sensors (RGB-Depth, microphones, laser telemeter). This thesis contains contributions of various natures :

- Sound classification in indoor environments : A small taxonomy is proposed in a classification method destined to enable a robot to detect human presence. Uncertainty of classification is taken into account through the use of belief functions, allowing us to label a sound as "unknown".

- Speaker tracking thanks to audiovisual data fusion : The robot is witness to a social interaction and tracks the successive speakers with probabilistic audiovisual data fusion. The proposed method was tested on videos extracted from the robot’s sensors.

- Navigation dedicated to human detection thanks to a multimodal fusion : The robot autonomously navigates in a known environment to detect humans thanks to heterogeneous sensors. The data is fused to create a multimodal perception grid. This grid enables the robot to chose its destinations, depending on the priority of perceived information. This system was implemented and tested on a Q.bo robot.

- Credibilist modelization of the environment for navigation : The creation of the multimodal perception grid is improved by the use of credibilist fusion. This enables the robot to maintain an evidential grid in time, containing the perceived information and its uncertainty. This system was implemented in simulation first, and then on a Q.bo robot.