Self-supervised learning for multilevel skeleton-based forgery detection via temporal-causal consistency of actions

Liang Hu*, Dora D. Liu, Qi Zhang*, Usman Naseem, Zhong Yuan Lai

*Corresponding author for this work

Research output: Contribution to journalConference paperpeer-review

Abstract

Skeleton-based human action recognition and analysis have become increasingly attainable in many areas, such as security surveillance and anomaly detection. Given the prevalence of skeleton-based applications, tampering attacks on human skeletal features have emerged very recently. In particular, checking the temporal inconsistency and/or incoherence (TII) in the skeletal sequence of human action is a principle of forgery detection. To this end, we propose an approach to self-supervised learning of the temporal causality behind human action, which can effectively check TII in skeletal sequences. Especially, we design a multilevel skeleton-based forgery detection framework to recognize the forgery on frame level, clip level, and action level in terms of learning the corresponding temporal-causal skeleton representations for each level. Specifically, a hierarchical graph convolution network architecture is designed to learn low-level skeleton representations based on physical skeleton connections and high-level action representations based on temporal-causal dependencies for specific actions. Extensive experiments consistently show state-of-the-art results on multilevel forgery detection tasks and superior performance of our framework compared to current competing methods.

Original languageEnglish
Pages (from-to)844-853
Number of pages10
JournalProceedings of the AAAI Conference on Artificial Intelligence
Volume37
Issue number1
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event37th AAAI Conference on Artificial Intelligence, AAAI 2023 - Washington, United States
Duration: 7 Feb 202314 Feb 2023

Fingerprint

Dive into the research topics of 'Self-supervised learning for multilevel skeleton-based forgery detection via temporal-causal consistency of actions'. Together they form a unique fingerprint.

Cite this