Filling and validating rainfall data based on statistical techniques and artificial intelligence

  • Camila Bermond Ruezzene Escola de Engenharia de São Carlos. Departamento de Hidráulica e Saneamento. Universidade de São Paulo (USP), Avenida Trabalhador são-carlense, n° 400, CEP: 13566-590, São Carlos, SP, Brazil.
  • Renato Billia de Miranda Gerência de Cursos e Matrizes. Anhanguera Educacional, Alameda Maria Tereza, n° 4266, CEP: 13278-181, Valinhos, SP, Brazil.
  • Talyson de Melo Bolleli Escola de Engenharia de São Carlos. Departamento de Hidráulica e Saneamento. Universidade de São Paulo (USP), Avenida Trabalhador são-carlense, n° 400, CEP: 13566-590, São Carlos, SP, Brazil.
  • Frederico Fábio Mauad Escola de Engenharia de São Carlos. Departamento de Hidráulica e Saneamento. Universidade de São Paulo (USP), Avenida Trabalhador são-carlense, n° 400, CEP: 13566-590, São Carlos, SP, Brazil.

Abstract

The study of the hydric regime of rainfall helps in management analysis and decision-making in hydrographic basins, but a fundamental condition is the need for continuous time series of data. Therefore, this study compared gap filling methods in precipitation data and validated them using robust statistical techniques. Precipitation data from the municipality of Itirapina, which has four monitoring stations, were used. Four gap filling techniques were used, namely: normal ratio method, inverse distance weighting, multiple regression and artificial neural networks, in the period from 1979 to 1989. For validation and performance evaluation, the coefficient of determination (R²), mean absolute error (MAE), mean squared error (RMSE), Nash-Sutcliffe coefficient (Nash), agreement index (D), confidence index were used (C) and through non-parametric techniques with Mann-Witney and Kruskal-Wallis test. Excellent performances of real data were verified in comparison with estimated data, with values above 0.8 of the coefficient of determination (R²) and of Nash. Kruskal-Wallis and Mann-Whitney tests were not significant in Stations C1 and C2, demonstrating that there is a difference between real and estimated data and between the proposed methods. It was concluded that the multiple regression and neural network methods showed the best performance. From this study, efficient tools were found to fill the gap, thus promoting better management and operation of water resources.

Keywords: artificial neural networks, inverse distance weighting, multiple regression, normal ratio method.


Published
14/12/2021
Section
Papers