Development of Trees Management System Using Radial Basis Function Neural Network for Rain Forecast ()
1. Introduction
Trees generally play a critical role across numerous significant aspects in human’s livelihood. In the metropolitan areas, trees can be advantageous in creating an environment that is highly systematic as they contribute to the large production of oxygen as well as the reduction of gaseous pollutants, hence improving the general air quality. However, it is also important to note that, tree planting in the cities requires excellent planning which includes the execution of a well-planned and highly efficient tree management system by the local municipalities. Successful tree growth depends on the ability to manage such factors as pests, diseases, water, fertilizer requirements, etc. Even though the data stemming from the monitoring of tree management activities could only be tracked once trees have been planted but there are some studies which have demonstrated that the data stemming would affected by the tree development [1]. Nevertheless, this is not an overnight process and intensive sources with higher number of datasets are frequently difficult to find and access.
Quantitative forecasting for weather element such as rainfall, humidity, temperature, and others is valuable in agricultural areas [2]. This is because, factoring in the weather element in the tree-management system or planning could substantially drive down the operational costs associated to it. Better scheduling of activities pertaining to the management of trees planted can also be derived if weather forecast is being taken into consideration as an important part of the planning process. This, therefore, has demonstrated that there is a need to innovate the existing system as to incorporate the weather element due to Malaysia’s climatic conditions.
Weather forecast is not only crucial in the urban planning to mitigate the risks stemming from weather that include floods and landslides but also in the general management of trees at the local recreational facilities and parks. Weather forecasting is a challenging activity. The challenging nature of weather forecasting can be derived from the complex atmospheric processes that are usually not presented via a perfectly linear correlation. Comparing the traditional and numerical methods, ANN is found to be the best approach for the weather prediction [3]. According to the research by [4] that compares linear and non-linear technique for rainfall prediction, ANN models are having higher correlations compared to the linear models, indicating better generalization capacity of the non-linear models in predicting by producing lower error and fitting better with the measured data.
Weather forecasting can be done by leveraging on the Numerical Weather Model (NWP) a statistical and Machine Learning-based model [4]. This statement being proved by the studies in [5], shows that the machine learning based model give a better prediction than the linear regression algorithm to predict rainfall. An artificial neural network computational tool is strong and data driven. Its characteristic is self-adaptive, flexible which has the capacity to learn and handle nonlinear and tough underlying characteristics of any physical process with high grade of accuracy [2]. Thanks to its ability in providing a better solution to complex problems that significantly reduce noise and optimize approximations, ANN is getting more attention from a lot of scholars in their attempts to unravel the inner workings of weather forecasting process [6]. Traditional methodologies are often found to encounter constant challenges in addressing non-linear physical, spatial and temporal processes as well as the underlying uncertainties that exist in certain parameters used. The attributes of the ANN model have therefore made it clear about its suitability to be deployed in studies pertaining to weather forecasting.
ANN consists of several techniques such as the feedforward neural networks, backpropagation neural networks and radial basis function neural network (RBFNN). RBFNN structure is similar to typical ANN structure, that consists of one hidden layer that contain three main parameter that is output weights, widths and centered [7]. [8] highlights that RBFNN is the most suitable machine learning technique to be deployed in the study that scrutinizes rainfall forecast in comparison to other techniques including Generic Programming, Support Vector Regression, M5-Rules, M5-Model trees and k-Nearest Neighbor. This argument has been further resonated by [9], in which they conclude that RBFNN provides higher accuracy of results in rainfall forecast as compared to the backpropagation neural network technique. Besides, another study conducted by [10] finds out that the use of multilayer perceptron is suitable for in Atner while RBFNN in Dharni RBFNN.
Thus, we have proposed a model of weather forecasting using Radial Basis Function Neural Networks (ANN) order to predict weather in a very effective and efficient way and to enhance the rainfall prediction in Malaysia by utilizing the RBFNN model. Other than its proven ability to provide a better rainfall prediction, this model also employs a better algorithm that is far more simplistic and with a faster learning capability. Rainfall prediction relies heavily on important weather-related parameters that include air pressure, temperature and wind speed that should certainly be considered in the development of a sound algorithm in rainfall prediction [11]. Therefore, an analysis has been conducted across a few meteorology datasets as data input centered around the important parameters such as temperature, windspeed, humidity as well as air pressure.
2. Data and Research Methodology
2.1. Rainfall Data
Having located near the earth’s equator, Malaysia is one of the countries that is blessed with a tropical climate which implies that it is hot and humid throughout the year. The location chosen for this study is Klang, Selangor and the daily rainfall data for the trend analysis used for this study is sourced directly from the Malaysia’s Department of Irrigation and Drainage with data ranging over the period of 6 years (from 1 January 2015 until 31 December 2020) recorded using the 400 cm2 Casella rain gauge. Three meteorological data parameters have been incorporated to enhance this study; temperature (˚C), humidity (%), air pressure (mbar) dan wind speed (km/hour). For this study specifically, 70% of the daily rainfall data collected is used for training data, while 15% for validation data and the last 15% rainfall data are used during testing process.
2.2. Model Performance Technique
The performance of a rainfall forecasting system is analyzed via two statistical analyses which are Root Mean Square Error (RMSE) as well as accuracy percentage. These statistical methods outline a comparison between the system rainfall prediction values with the measured rainfall data value. A lower error value indicates a better performance of rainfall forecast system. (ε) represents the error percentage. The accuracy percentage and the Root Mean Square Error (RMSE) are calculated via the following formula:
(1)
(2)
(3)
2.3. Radial Basis Function Neural Network
The concept of Artificial Neural Network is a network composed by a collection of processing units that are computer-programmed based on the working of human brain [12]. The human brain is a complex system capable of processing a large amount of information at a time. The neural network is a processor that is made up of artificial neurons as the main the processing element [13]. The application of the neural network has been extensive in the areas of pattern classification, grouping, prediction and optimization among many others. Rainfall is one of the natural phenomena with non-linear attributes, thus, requiring a non-statistical method that is far more complex as to understand the rainfall behavioral patterns just like how the Artificial Neural Network does [13].
RBFNN was first introduced by Broomhead dan Lowe in the year 1988 [14]. This paper use the algorithm of Radial Basis Function Network (RBFN) because RBFN is one of the neural networks with capability to predict non-linear attributes and with faster learning speed. RBFN is also a particularly distinctive artificial neural network as it leverages on the basis function network as the active function. The application of RBFN is used widely particularly for function approximation, time series forecasting and classification. RBFN model that comprises of three layers, the input layer, the hidden layer and the output layer is shown in Figure 1 [15].
Every layer is made up of nodes that connected one another as illustrated by Figure 1. As demonstrated in Figure 1, every neuron is also connected with one another. This information will later enter the hidden layer where the learning process shall commence. This directly implies that the calculations are performed so that system learns the patterns by manipulating the weighted value until it matches the output value. Radial basis function is a function only depends on value of distance from the origin. Thus, the limitation of radial basis function formula must contain only real values. The transformation associated to every node located within the hidden layer using the gaussian function which is derived by the following formula:
(4)
Figure 1. Radial basis function network (RBFN) modelling.
(5)
where zj(x) is the output from the hidden j node and Bias node. z0(x) has a fixed value which is 1. x is an input with n-dimension. σj is the width of receptive field for the hidden j node, μj is the centre of the hidden j node and dj is the Euclidean distance between x and μj. On the output, all nodes are connected with one another. The output value, y(x) RBFN is:
(6)
where yl is output node l and wjl is the weight that has a relationship with the hidden j node and the output l node.
3. Results and Discussion
In this study, statistical analysis that has been used is root mean square error (RMSE) and the percentage of accuracy. The optimal RBFNN structure is determined by manipulating the hidden neuron value between 4 and 99. The RMSE values between the predicted values and measured values for the RNFNN model are illustrated in Table 1. This study discovers that the hidden neuron value has less affect on the performance of the RMSE value which is between 49.92% up to 51.10%. However, in term of the percentage of accuracy, the hidden node value plays a significant role, in which, it leads to higher percentage of accuracy from the hidden nodes between 61% up to 99%. This, therefore, has demonstrated that more hidden nodes will contribute to a better analysis. Nevertheless, it is essential to note that the increase in the number of nodes will add another layer of complexity for a neural network model. Hence, optimum selection of hidden nodes is one of the most important steps that should be taken during a neural network modelling process.
Table 1. The preliminary performance of radial basis function neural network for hidden nodes.
The learning rate that influences the performance of the system is also being factored in. The rate of learning is between 0.001 to 0.1. Table 2 outlines the RMSE as well as the percentage of accuracy. Based on Table 3, even though the rate of learning of 0.1 contributes to a better percentage of accuracy (56.74%), the RMSE value has proven to be higher (63.26%). The rate of learning of 0.1 gives a lower value of RMSE which is 49.92%. Therefore, the RBFNN with the hidden node of 61 and the rate of learning of 0.01 is used during the training, validation as well as testing processes. The optimal weighted value that has been trained by the system will then be used for the testing process.
The second objective of this study is to analyze the influence of the selected data input variables on the performance of the RBFNN model in the rainfall forecasting. [12] highlights that the selection of suitable data input plays a critical role in reducing the time required to train the RBFNN model while improving the overall accuracy of the results. Figures 2(a)-(c) illustrate the relationship patterns between the cumulative rainfall data with parameters such as temperature, humidity as well as air pressure. Figure 2(a) demonstrates that the cumulative daily rainfall in Malaysia is less than 60 mm and the rainfall data shows higher concentration between 30˚C and 33˚C in temperature. In terms of humidity, as shown in Figure 2(b), the cumulative rainfall data shows a comprehensible pattern when the humidity is between 40% to 90% for daily rainfall of less than 60 mm. When illustrating the relationship of cumulative daily rainfall against air pressure, Figure 2(c) proves that the daily rainfall and air pressure are correlated to each other ranging from 1007 mbar up to 1013 mbar for cumulative daily rainfall less than 40 mm.
Second evaluation of the meteorological data revolves around the assessment of the performance of the data input of the network model. A lower value of RMSE indicates a better rainfall forecast, this has been demonstrated by the data outlined in Table 3 where a lower RMSE was recorded when parameters such as temperature, humidity and air pressure were introduced. The percentage of accuracy also increases by 5% thus, has further proven the importance of draw the
Table 2. The radial basis function neural network model performance for learning rate.
Table 3. Meteorological data influence on the radial basis function neural network model.
(a)(b)(c)
Figure 2. (a) Relationship pattern between temperature (˚C) and daily cummulative rainfall (mm); (b) Relationship Pattern between Humidity (%) and cumulative daily rainfall (mm); (c) relationship pattern between air pressure (mbar) and cumulative daily rainfall (mm).
relationships between other parameters in the underlying meteorological data with the rainfall forecast. This study also paves way to future studies to explore other meteorological data parameters that could potentially influence the accuracy of rainfall forecasting such as sea level, minimum as well as maximum temperature similar to a study conducted by [12].
4. Conclusion
Based on the radial basis function neural network scheme, this paper conducted the performance of weather parameters analysis that affects the performance of rain forecasting system. Based on this study, it has been made explicit that the higher value of hidden nodes results shows a better performance of the rainfall forecasting system, but it will add more complexity to the neural network model and to the training process involved. Plus, the overall outcome outlines that the selection of optimal hyperparameters of the RBFNN model as well as accurate meteorological data had impact on the performance of the RBFNN model. The accuracy and the performance of the system can be enhanced further by adding in more factors in subsequent studies. This study will be useful in the application of tree management system as it incorporates the essential weather element.