Skip to main navigation Skip to search Skip to main content

Frequency-aware spatio-temporal topology learning for skeleton-based human activity recognition

Yi Xia, Sira Yongchareon*, Raymond Lutui, Quan Z. Sheng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Skeleton-based human activity recognition (HAR) has made significant progress through graph convolutional networks (GCNs) and Transformer architectures for spatiotemporal modeling. However, existing methods either employ predefined static graph topologies that cannot adapt to heterogeneous skeleton data or learn dynamic topologies based solely on local spatiotemporal features, thereby overlooking the global temporal frequency features of joint movements that are important for discovering semantically meaningful spatial relationships. We propose Frequency-Aware Topology Learning Graph Convolutional Network (FATL-GCN), a novel architecture that integrates frequency-aware temporal context to guide adaptive learning of spatial topology. Our approach leverages Time-to-Vector linear frequency encoding to capture both periodic and non-periodic motion patterns, employs frequency-guided topology learning to generate action-specific graphs through temporal-context-driven attention, and incorporates hierarchical multi-scale fusion for robust feature extraction across scales. Extensive experiments achieved top-1 accuracies of 93.8% (cross-subject) and 97.5% (cross-view) on NTU-60, 91.9% (cross-subject) and 93.1% (cross-setup) on NTU-120, and 51.7% on Kinetics-Skeleton. Ablation studies confirm the critical role of our components, with removing the dynamic graph topology causing a 3.5% accuracy drop and removing frequency-aware encoding causing a 2.1% drop.

Original languageEnglish
Article number113146
Pages (from-to)1-11
Number of pages11
JournalPattern Recognition
Volume175
DOIs
Publication statusPublished - Jul 2026

Bibliographical note

Copyright the Author(s) 2026. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • Skeleton-based activity recognition
  • Graph convolutional networks
  • Spatio-temporal modeling
  • Attention mechanisms
  • Dynamic topology
  • Frequency analysis

Fingerprint

Dive into the research topics of 'Frequency-aware spatio-temporal topology learning for skeleton-based human activity recognition'. Together they form a unique fingerprint.

Cite this