Convolutional 3D attention network for video based freezing of gait recognition

Renfei Sun, Zhiyong Wang, Kaylena Ehgoetz Martens, Simon Lewis

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

25 Citations (Scopus)

Abstract

Freezing of gait (FoG) is defined as a brief, episodic absence or marked reduction of forward progression of the feet despite the intention to walk. It is a typical symptom of Parkinson's disease (PD) and has a significant impact on the life quality of PD patients. Generally trained experts need to review the gait of a patient for clinical diagnosis, which is time consuming and subjective. Nowadays, automatic FoG identification from videos provides a promising solution to address these issues by formulating FoG identification as a human action recognition task. However, most existing human action recognition algorithms are limited in this task as FoG is very subtle and can be easily overlooked when being interfered with by irrelevant motion. In this paper, we propose a novel action recognition algorithm, namely convolutional 3D attention network (C3DAN), to address this issue by learning an informative region for more effective recognition. The network consists of two main parts: Spatial Attention Network (SAN) and 3-dimensional convolutional network (C3D). SAN aims to generate an attention region from coarse to fine, while C3D extracts discriminative features. Our proposed approach is able to localize attention region without manual annotation and to extract discriminative features in an end-to-end way. We evaluate our proposed C3DAN method on a video dataset collected from 45 PD patients in a clinical setting for the quantification of FoG in PD. We obtained sensitivity of 68.2%, specificity of 80.8% and accuracy of 79.3%, which outperformed several state-of-the-art human action recognition methods. To the best of our knowledge, our work is one of the first studies detecting FoG from clinical videos.

Original languageEnglish
Title of host publication2018 International Conference on Digital Image Computing
Subtitle of host publicationTechniques and Applications (DICTA)
EditorsMark Pickering, Lihong Zheng, Shaodi You, Ashfaqur Rahman, Manzur Murshed, Md Asikuzzaman, Ambarish Natu, Antonio Robles-Kelly, Manoranjan Paul
Place of PublicationUnited States
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Number of pages7
ISBN (Electronic)9781538666029
DOIs
Publication statusPublished - 16 Jan 2019
Externally publishedYes
Event2018 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2018 - Canberra, Australia
Duration: 10 Dec 201813 Dec 2018

Conference

Conference2018 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2018
Country/TerritoryAustralia
CityCanberra
Period10/12/1813/12/18

Fingerprint

Dive into the research topics of 'Convolutional 3D attention network for video based freezing of gait recognition'. Together they form a unique fingerprint.

Cite this