SimFuPulse: a self-similarity supervised model for remote photoplethysmography extraction from facial videos

Hanguang Xiao*, Zhipeng Li, Ziyi Xia, Tianqi Liu, Feizhong Zhou, Alberto Avolio

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Remote Photoplethysmography (rPPG) is a non-contact technique for extracting physiological signals using facial videos, exhibiting broad application prospects in fields such as anti-spoofing face recognition, healthcare, and affective computing. However, extracting rPPG signals from facial video sequences encounters challenges due to subtle color variations and noise interference. Additionally, the presence of phase offset between ground truth and facial videos further complicates this endeavor. To address the issues of weak signals, strong noise, and phase offset, we propose a self-similarity supervised learning approach, named SimFuPulse, to mitigate noise and enhance rPPG representation by fusing original and differential video frames. By employing a 3D convolutional network (ResPhys) with an encoder–decoder architecture, enhanced spatiotemporal features are modeled to extract reliable rPPG signals. Moreover, a self-similarity mechanism is devised to mitigate the impact of phase offset on model training. The proposed method demonstrates superior accuracy over current state-of-the-art approaches across three publicly available datasets.

Original languageEnglish
Article number106736
Pages (from-to)1-13
Number of pages13
JournalBiomedical Signal Processing and Control
Volume98
DOIs
Publication statusPublished - Dec 2024

Keywords

  • Face video
  • Heart rate
  • Remote photoplethysmography
  • Supervised learning

Cite this