The 2nd International Workshop on Human-centric Multimedia Analysis
20-24 October 2021
Cheng Du, China
View on ACM MM 2021


2021/10/14: The full technical program is realeased. Link
2021/10/14: Our workshop will be held in Langbo Room B, The Langbo Chengdu, in The Unbound Collection by Hyatt. Online Zoom Meeting Room ID: 864 1291 4191
2021/10/10: We invited Prof. Yihong Gong, Prof. Junwei Han, and Dr. Chuang Gan as Keynote Speaker. Link
2021/08/03: Excellent papers will be recommended to ACM TOMM Special Issue
2021/08/03: The Workshop paper submission deadline is extended to Aug. 17, 2021 Link
2021/05/15: The Workshop paper submission deadline is Aug. 10, 2021 Link
2021/05/15: Paper submission details is available Link
2021/03/26: The website is available


Human-centric multimedia analysis is one of the fundamental problems in multimedia understanding. It is a very challenging problem that involves multiple tasks such as face detection and recognition, human pose estimation, human action detection, human-object interaction, person tracking, person re-identification, and so on. Today, ubiquitous multimedia sensors and large-scale computing infrastructures are producing at a rapid velocity a wide variety of big multi-modality data for human-centric analysis, which provides rich knowledge to tackle these challenges. Researchers have strived to push the limits of human-centric multimedia analysis in various applications, such as intelligent surveillance, retailing, fashion design, and services. Therefore, the purpose of this workshop is to: 1) bring together the state-of-the-art research on human-centric multimedia analysis; 2) call for a coordinated effort to understand the opportunities and challenges emerging in human-centric multimedia analysis; 3) identify key tasks and evaluate the state-of-the-art methods; 4) showcase innovative methodologies and ideas; 5) introduce interesting real-world human-centric multimedia analysis systems or applications; and 6) propose new real-world datasets and discuss future directions. We solicit original contributions in all fields of human-centric multimedia analysis that explore the multi-modality data to understand the behavior of humans. We believe this workshop will offer a timely collection of research updates to benefit researchers and practitioners in the broad multimedia communities. To this end, we solicit original research and survey papers in (but not limited to) the following topics:

  • Face detection, recognition, face anti-spoofing, face landmark detection and parsing.
  • Human detection, pose estimation, human parsing, and pose tracking.
  • Human 3D shape estimation and reconstruction.
  • Human gait recognition, person re-identification and person tracking.
  • Human action recognition and detection
  • Human activity recognition using non-visual sensors
  • Huma-computer interaction / Human object interaction
  • Multimedia event detection
  • Anomaly event detection
  • Human crowd analysis

Keynote Speakers

Prof. Yihong Gong

Xi’an Jiaotong University

Prof. Junwei Han

Northwestern Polytechnical University

Dr. Chuang Gan

MIT-IBM Watson AI Lab

More information


Wu Liu

JD AI Research, Beijing, China

Xinchen Liu

JD AI Research, Beijing, China

Jingkuan Song

University of Electronic Science and Technology of China

Dingwen Zhang

Northwestern Polytechnical University

Wenbing Huang

Tsinghua University

Junbo Guo

State Key Laboratory of Communication Content Cognition, People’s Daily Online

John Smith

IBM Research

If you have any questions, feel free to contact liuxinchen1 [at] jd [dot] com

More information