This paper delves into the impact of diverse scenarios regarding training data availability on the accuracy of machine learning methods employed for predicting PV production at the regional level. Specifically, we analyze methods including K-Nearest Neighbors, Support Vector Regression, Gradient Boosting, Kernel Ridge Regression, Random Forest, and an ensemble of these methods. Our main goal is to uncover the dynamics arising from varying data availability conditions, aiming to elucidate the strengths and limitations of each method under such circumstances. The findings contribute not only to theoretical comprehension but also provide practical insights for the effective application of these methods in real-world scenarios with differing levels of training data availability. Additionally, we demonstrate the capability and effectiveness of combining different methods to achieve improved and more resilient results in hourly power forecasting of PV production.

Analyzing the Impact of Training Data Availability on Machine Learning Models Accuracy for Regional Photovoltaic Production Forecast

Taheri N.;Tucci M.
2024-01-01

Abstract

This paper delves into the impact of diverse scenarios regarding training data availability on the accuracy of machine learning methods employed for predicting PV production at the regional level. Specifically, we analyze methods including K-Nearest Neighbors, Support Vector Regression, Gradient Boosting, Kernel Ridge Regression, Random Forest, and an ensemble of these methods. Our main goal is to uncover the dynamics arising from varying data availability conditions, aiming to elucidate the strengths and limitations of each method under such circumstances. The findings contribute not only to theoretical comprehension but also provide practical insights for the effective application of these methods in real-world scenarios with differing levels of training data availability. Additionally, we demonstrate the capability and effectiveness of combining different methods to achieve improved and more resilient results in hourly power forecasting of PV production.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/1296569
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact