Resizing images is a critical pre-processing step in computer vision. Principally, deep learning models train faster on small images. A larger input image requires the neural network to learn from four times as many pixels, and this increase the training time for the architecture. In this work, we presented the evolution of effects of image resizing on model training time and performance. This study is applied on a vehicle dataset. We used You Look Only Once based architectures which include YOLOv2, YOLOv3, YOLOv4, and YOLOv5 with pretrained models to perform object detection. YOLO is designed to detect objects with high accuracy and high speed, which is an advent for real-time applications. Data augmentation method is used in this research to reduce overfitting problems, which approximates the data probability by manipulating the input samples. The experimental results show that if the input image size varies, then it has effects on the training time of the CNN based images classification. Additionally, this research reviewed image resizing and its impacts on the models’ performance in terms of accuracy, precision, and recall.

Impact of Image Resizing on Deep Learning Detectors for Training Time and Model Performance

Saponara S.
Co-primo
;
Elhanashi A.
Co-primo
2022-01-01

Abstract

Resizing images is a critical pre-processing step in computer vision. Principally, deep learning models train faster on small images. A larger input image requires the neural network to learn from four times as many pixels, and this increase the training time for the architecture. In this work, we presented the evolution of effects of image resizing on model training time and performance. This study is applied on a vehicle dataset. We used You Look Only Once based architectures which include YOLOv2, YOLOv3, YOLOv4, and YOLOv5 with pretrained models to perform object detection. YOLO is designed to detect objects with high accuracy and high speed, which is an advent for real-time applications. Data augmentation method is used in this research to reduce overfitting problems, which approximates the data probability by manipulating the input samples. The experimental results show that if the input image size varies, then it has effects on the training time of the CNN based images classification. Additionally, this research reviewed image resizing and its impacts on the models’ performance in terms of accuracy, precision, and recall.
2022
978-3-030-95497-0
978-3-030-95498-7
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/1143052
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? ND
social impact