Robust counting in overcrowded scenes using batch-free normalized deep ConvNet

Sana Zahir, Rafi Ullah Khan, Mohib Ullah, Muhammad Ishaq, Naqqash Dilshad, Amin Ullah*, Mi Young Lee*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)
79 Downloads (Pure)

Abstract

The analysis of overcrowded areas is essential for flow monitoring, assembly control, and security. Crowd counting's primary goal is to calculate the population in a given region,which requires real-time analysis of congested scenes for prompt reactionary actions. The crowd is always unexpected, and the benchmarked available datasets have a lot of variation, which limits the trained models' performance on unseen test data. In this paper, we proposed an end-to-end deep neural network that takes an input image and generates a density map of a crowd scene. The proposed model consists of encoder and decoder networks comprising batch-free normalization layers known as evolving normalization (EvoNorm). This allows our network to be generalized for unseen data because EvoNorm is not using statistics from the training samples. The decoder network uses dilated 2D convolutional layers to provide large receptive fields and fewer parameters, which enables real-time processing and solves the density drift problem due to its large receptive field. Five benchmark datasets are used in this study to assess the proposed model, resulting in the conclusion that it outperforms conventional models.

Original languageEnglish
Pages (from-to)2741-2754
Number of pages14
JournalComputer Systems Science and Engineering
Volume46
Issue number3
DOIs
Publication statusPublished - 2023
Externally publishedYes

Bibliographical note

Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • Artificial intelligence
  • deep learning
  • crowd counting
  • scene understanding

Fingerprint

Dive into the research topics of 'Robust counting in overcrowded scenes using batch-free normalized deep ConvNet'. Together they form a unique fingerprint.

Cite this