TY - GEN
T1 - Image data augmentation and convolutional feature map visualizations in computer vision applications
AU - Lotfi, Fariba
AU - Tohidian, Fatemeh
AU - Jamzad, Mansour
AU - Beigy, Hamid
PY - 2023
Y1 - 2023
N2 - Deep neural networks (DNNs) perform exceptionally well in many vision tasks, including image classification, annotation, and object recognition. However, these networks are like a black box, and high-quality training datasets are required for deep learning models to achieve high performance. Due to the high cost of collecting a vast number of data samples, data augmentation techniques have been employed in many vision applications. Data augmentation aims to increase the dataset size without collecting new data while introducing variability. One of the means of augmenting the image data is by employing image transformations such as flipping, clipping, or rotation. Activation maps, also known as feature maps, illustrate how the filters are applied to the input image. The objective of visualizing a feature map for an input image is to comprehend what input features are captured in the feature maps. In this paper, we apply various transformations on images and investigate their effect on the multiple convolutional layers (at low, middle, and high levels) by employing intermediate feature map visualizations. We use the famous deep learning-based pre-trained network, VGG-16. Finally, we compare the visualization results of the image transformations at multiple levels and analyze their differences to evaluate the validity of these networks.
AB - Deep neural networks (DNNs) perform exceptionally well in many vision tasks, including image classification, annotation, and object recognition. However, these networks are like a black box, and high-quality training datasets are required for deep learning models to achieve high performance. Due to the high cost of collecting a vast number of data samples, data augmentation techniques have been employed in many vision applications. Data augmentation aims to increase the dataset size without collecting new data while introducing variability. One of the means of augmenting the image data is by employing image transformations such as flipping, clipping, or rotation. Activation maps, also known as feature maps, illustrate how the filters are applied to the input image. The objective of visualizing a feature map for an input image is to comprehend what input features are captured in the feature maps. In this paper, we apply various transformations on images and investigate their effect on the multiple convolutional layers (at low, middle, and high levels) by employing intermediate feature map visualizations. We use the famous deep learning-based pre-trained network, VGG-16. Finally, we compare the visualization results of the image transformations at multiple levels and analyze their differences to evaluate the validity of these networks.
KW - Deep neural networks
KW - Data augmentation
KW - Image transformations
KW - Activation maps
KW - Feature maps
KW - Convolutional layers
KW - Visualizations
UR - https://www.scopus.com/pages/publications/85151051459
U2 - 10.1007/978-3-031-26507-5_2
DO - 10.1007/978-3-031-26507-5_2
M3 - Conference proceeding contribution
AN - SCOPUS:85151051459
SN - 9783031265068
T3 - Lecture Notes in Computer Science
SP - 15
EP - 26
BT - Service-Oriented Computing – ICSOC 2022 Workshops
A2 - Troya, Javier
A2 - Mirandola, Raffaela
A2 - Navarro, Elena
A2 - Delgado, Andrea
A2 - Segura, Sergio
A2 - Ortiz, Guadalupe
A2 - Pautasso, Cesare
A2 - Zirpins, Christian
A2 - Fernández, Pablo
A2 - Ruiz-Cortés, Antonio
PB - Springer, Springer Nature
CY - Cham
T2 - Workshops on ASOCA, AI-PA, FMCIoT, WESOACS 2022, held in Conjunction with the 20th International Conference on Service-Oriented Computing, ICSOC 2022
Y2 - 29 November 2022 through 2 December 2022
ER -