Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos

Qiao, Tanqiu; Men, Qianhui; Li, Frederick W.B.; Kubotani, Yoshiki; Morishima, Shigeo; Shum, Hubert P.H.

doi:10.1007/978-3-031-19772-7_28

Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos

Qiao, Tanqiu; Men, Qianhui; Li, Frederick W.B.; Kubotani, Yoshiki; Morishima, Shigeo; Shum, Hubert P.H.

Authors

Tanqiu Qiao tanqiu.qiao@durham.ac.uk
PGR Student Doctor of Philosophy

Qianhui Men

Dr Frederick Li frederick.li@durham.ac.uk
Associate Professor

Yoshiki Kubotani

Shigeo Morishima

Dr Hubert Shum hubert.shum@durham.ac.uk
Associate Professor

Abstract

Human-Object Interaction (HOI) recognition in videos is important for analysing human activity. Most existing work focusing on visual features usually suffer from occlusion in the real-world scenarios. Such a problem will be further complicated when multiple people and objects are involved in HOIs. Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN). The geometric-level graph models the interdependency between geometric features of humans and objects, while the fusion-level graph further fuses them with visual features of humans and objects. To demonstrate the novelty and effectiveness of our method in challenging scenarios, we propose a new multi-person HOI dataset (MPHOI-72). Extensive experiments on MPHOI-72 (multi-person HOI), CAD-120 (single-human HOI) and Bimanual Actions (two-hand HOI) datasets demonstrate our superior performance compared to state-of-the-arts.

Citation

Qiao, T., Men, Q., Li, F. W., Kubotani, Y., Morishima, S., & Shum, H. P. (2022). Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos. . https://doi.org/10.1007/978-3-031-19772-7_28

Conference Name	Computer Vision - ECCV 2022
Conference Location	Tel Aviv, Israel
Start Date	Oct 23, 2022
End Date	Oct 27, 2022
Acceptance Date	Jul 8, 2022
Online Publication Date	Oct 28, 2022
Publication Date	2022
Deposit Date	Jul 19, 2022
Publicly Available Date	Oct 29, 2023
Publisher	Springer Verlag
Pages	474-491
Series Title	Lecture Notes in Computer Science
Series ISSN	0302-9743
DOI	https://doi.org/10.1007/978-3-031-19772-7_28
Public URL	https://durham-repository.worktribe.com/output/1136540