AIP Publishing Logo

  • Previous Article
  • Next Article

Rainfall prediction using machine learning techniques

  • Split-Screen
  • Article contents
  • Figures & tables
  • Supplementary Data
  • Peer Review
  • Open the PDF for in another window
  • Reprints and Permissions
  • Cite Icon Cite
  • Search Site

S. L. Jany Shabu , J. Refonaa , D. Devi , D. Aishwarya , K. Krishna Babu , K. Purshotham Reddy; Rainfall prediction using machine learning techniques. AIP Conf. Proc. 17 May 2024; 2850 (1): 030004. https://doi.org/10.1063/5.0208435

Download citation file:

  • Ris (Zotero)
  • Reference Manager

India is a farming nation and its economy is to a great extent dependent on rainforest creation. Downpour estimates are vital and fundamental for all ranchers to examine crop yields. Unsurprising rainfall is the capacity to foresee the climate with the assistance of science and innovation. It is essential to know how much rainfall to utilize water assets, horticultural creation and water arranging proficiently. Various strategies for information mining can foresee rainfall. Information extraction is utilized to appraise rainfall. This article features probably the most well-known rainfall forecast calculations. Guileless Bayes, K-Near Neighbour Algorithm, and Certificate Tree are a portion of the calculations contrasted with this record. According to a relative perspective, it is feasible to break down how rainfall is accurately anticipated .

Citing articles via

Publish with us - request a quote.

rainfall prediction using machine learning thesis

Sign up for alerts

  • Online ISSN 1551-7616
  • Print ISSN 0094-243X
  • For Researchers
  • For Librarians
  • For Advertisers
  • Our Publishing Partners  
  • Physics Today
  • Conference Proceedings
  • Special Topics

pubs.aip.org

  • Privacy Policy
  • Terms of Use

Connect with AIP Publishing

This feature is available to subscribers only.

Sign In or Create an Account

  • Open access
  • Published: 07 December 2021

Machine learning techniques to predict daily rainfall amount

  • Chalachew Muluken Liyew   ORCID: orcid.org/0000-0003-4031-8032 1 &
  • Haileyesus Amsaya Melese 1  

Journal of Big Data volume  8 , Article number:  153 ( 2021 ) Cite this article

31k Accesses

55 Citations

7 Altmetric

Metrics details

Predicting the amount of daily rainfall improves agricultural productivity and secures food and water supply to keep citizens healthy. To predict rainfall, several types of research have been conducted using data mining and machine learning techniques of different countries’ environmental datasets. An erratic rainfall distribution in the country affects the agriculture on which the economy of the country depends on. Wise use of rainfall water should be planned and practiced in the country to minimize the problem of the drought and flood occurred in the country. The main objective of this study is to identify the relevant atmospheric features that cause rainfall and predict the intensity of daily rainfall using machine learning techniques. The Pearson correlation technique was used to select relevant environmental variables which were used as an input for the machine learning model. The dataset was collected from the local meteorological office at Bahir Dar City, Ethiopia to measure the performance of three machine learning techniques (Multivariate Linear Regression, Random Forest, and Extreme Gradient Boost). Root mean squared error and Mean absolute Error methods were used to measure the performance of the machine learning model. The result of the study revealed that the Extreme Gradient Boosting machine learning algorithm performed better than others.

Introduction

Based on the distribution of rainfall in Ethiopia, three distinct seasons are identified which are Belg, Kiremt and Bega. According to Ehsan et al. [ 1 ] three seasons are; the ‘short’ rains (belg: February–May), followed by the long rains (kiremt: June–September) and the dry season (Bega: October–January). Kiremt is the main Ethiopian rainy season, and Ethiopia receives a substantial fraction of its annual rainfall during this season, which is very important for its water resources management and agriculture production. The northwestern part of the country at which this research is conducted experiences higher rainfall amounts from June to September that send a flood into the Blue Nile. Droughts and floods have been a major and persistent challenge of the management of water resources, agroeconomic, livestock growth, and food production in Ethiopia. To use the rainfall water efficiently, rainfall prediction is unquestionable research area in Ethiopia.

Rainfall prediction is crucial for increasing agricultural productivity which in turn secures food and quality water supply for citizens of one's country. The scarcity of rainfall has a negative influence on the aquatic ecosystem, quality water supply, and agricultural productivity. Agriculture and water quality depend on the rainfall and water amount on a daily and annual basis [ 2 , 3 , 4 ]. Therefore, accurate prediction of daily rainfall is a challenging task to manage the rainfall water for agriculture and water supply.

Various researchers conducted studies to improve the prediction of daily, monthly and annual rainfall amounts using different countries' meteorology data. Researchers applied data mining techniques [ 2 , 3 , 5 , 6 ] Big Data analysis [ 4 , 7 ], and different machine learning algorithms [ 8 , 9 , 10 , 11 ] to improve the accuracy of daily, monthly and annual rainfall prediction. According to the results of the studies, the prediction process is now shifted from data mining techniques to machine learning techniques. Scholars, for example [ 4 ], confirmed that machine learning algorithms are proved to be better replacing the traditional deterministic method to predict the weather and rainfall. Consequently, this paper analyzed different machine learning algorithms to identify the better machine learning algorithms for accurate rainfall prediction.

Several environmental factors affect the existence of rainfall and its intensity. The temperature, relative humidity, sunshine, pressure, evaporation, etc. are some of the factors that affect the existence of rainfall and its intensity directly or indirectly. The study conducted by Chaudhari and Choudhari [ 12 ] indicated that temperature, wind, and cyclone were important features of the atmosphere over the Indian region to predict rainfall, however, the study did not measure the correlations of each feature to determine the strength of the independent features on the rainfall. On the other hand, a correlation study by Thirumalai et al. [ 13 ] identified the most important features like solar radiation, perceptible water vapor, and diurnal features for rainfall prediction using a linear regression model. Whereas, scholars (for example, [ 10 , 11 , 14 ]) used atmospheric features of temperature, relative humidity, pressure, and wind speed as an important feature to predict rainfall accurately using machine learning such as Artificial Neural Network, Random forest, and multiple linear regression model respectively. Hence, important atmospheric features that have a direct or indirect impact on rainfall should be studied to predict the existence and the intensity of rainfall.

Therefore, this study aimed to identify the relevant atmospheric features that cause rainfall and predict the intensity of daily rainfall using machine learning techniques. The raw data is collected from regional meteorology and preprocessed to make it suitable for the experiment. Each feature of the preprocessed data is correlated with the rainfall variable to identify the relevant features using Pearson correlation. The study then experimented the Radnom forest (RF), MLR and XGBoost machine learning algorithms. The MAE and RMSE values of the XGBoost gradient descent algorithms were 3.58 and 7.85 respectively so that The XGBoost algorithm predicted the rainfall using relevant selected environmental features better than the RF and the MLR.

Related work

The machine learning algorithm called linear regression is used for predicting the rainfall using important atmospheric features by describing the relationship between atmospheric variables that affect the rainfall [ 13 , 15 ]. The correlation study is conducted [ 7 ], and identified solar radiation, perceptible water vapor, and diurnal features are important variables for daily rainfall prediction using a data-driven machine learning algorithm. The future work identified by Manandhar et al. [ 7 ] is studying the impact of using different atmospheric features using a larger data set. The researches address the relationship between independent and dependent features to identify which features impact the rainfall to rain or not to rain. The amount of daily rainfall was not found or addressed in this research,it may reduce the performance of the system. Tharun et al. [ 5 ] performed the accuracy measure of the comparative study of statistical modeling and regression techniques (SVM, RF & DT) for rainfall prediction using environmental features. According to the result of the study, the regression techniques of rainfall prediction outperformed the statistical modeling. The experimental result showed that the RF model performed and predicted accurately than the SVM and DT. Hence, rainfall prediction is accurate, it shows high performance in machine learning models than the traditional models. This research used different machine learning techniques rather than statistical methods to predict daily rainfall amounts.

The study by Arnav Garg and Kanchipuram [ 8 ] shows three machine learning algorithm experiments such as support vector machine (SVM), support vector regression (SVR), and K-nearest neighbor (KNN) using the patterns of rainfall in the year. The SVM algorithm performs best among the three machine learning algorithms. This research did not show the experiment result that which environmental features impact the intensity of rainfall. This paper shows the environmental features that have a positive and negative impact on rainfall and predicts the daily rainfall amount using those features.

Scholars, for example, [ 14 , 16 ] confirmed that the multiple linear regression machine learning algorithm outperforms well to predict rainfall using dependent weather variables of temperature, humidity, moisture, wind speed, and finally the study showed the performance of the rainfall prediction can be improved using deep learning models as future work. According to Sarker [ 17 , 18 ] the performance comparison between deep learning and other machine learning algorithms has been shown in Fig.  1 below, where the deep learning model performance increases when the size of the data is increased. Due to the size of the data that is used in this study, machine learning techniques are appropriate.

figure 1

Performance comparison between Deep learning and most machine learning algorithms depending on the amount of data

Scholars [ 9 , 10 ] studied the deep learning algorithm for rainfall prediction by using different dependent weather variables. To provide an accurate prediction of rainfall, prediction models have been developed and experimented with using machine learning techniques.

Therefore, most researchers did not show the prediction of the daily rainfall amount rather conducting experiments on environmental data to predict whether rain or not rain and predict average annual rainfall amount that is the prediction of daily rainfall amount is a challenging task. All relevant environmental features important for rainfall prediction were not used. this paper examined the machine learning algorithms using data collected from one meteorology station which is relatively small in size and selected the appropriate environmental features that correlate with rainfall positively or negatively to examine the performance of the daily rainfall amount prediction machine learning algorithms using MAE and RMSE.

Machine learning algorithms

To choose the better machine learning algorithms to study the daily rainfall amount prediction, various papers have been reviewed concerning rainfall prediction. To predict the daily rainfall intensity using the real-time environmental data, three algorithms such as MLP, RF, and XGBoost gradient descent were chosen for the experiment. Hence, the three machine learning algorithms were experimented with and compared to report the better algorithms to predict the daily rainfall amount.

Multivariate linear regression (MLR)

Linear regression can be multivariate which has multiple independent variables used as input features and simple linear regression which has only one independent or input feature. Both linear regressions have one dependent variable which can be forecasted or predicted based on the input features. This paper presented the multivariate linear regression because multiple environmental variables or features were used to predict the dependent variable called daily rainfall amount. Linear regression is a supervised machine learning technique used to predict the unknown daily rainfall amount using the known environmental variables. The multivariate linear regression used multiple explanatory or independent variables (X) and single dependent or output variable denoted by Y. Hence, the general equation of the multiple linear regression is given as:

where \(x_{i}^{T} { }\) is transpose of \(x_{i}\) the input or independent variable, \(\beta\) is regression coefficient, \(\varepsilon_{i}\) is error term or noise, \(Y_{i}\) is a dependent variable.

The general multivariate linear regression equation of this paper is given as

The size of the data set collected from the meteorological station for this study was appropriate to use the machine learning algorithms called multivariate linear regression that can estimate the daily amount of rainfall in the region. This algorithm can show how strongly each environmental variable influences the intensity of the daily rainfall.

Random forest (RF)

A Random Forest Regression model is powerful and accurate. It usually performs great on many problems, including features with non-linear relationships. Random forest regression is a supervised machine learning algorithm that uses the ensemble learning method for regression. RF works by building several decision trees during training time and outputting the mean of the classes as the prediction of all the trees. The RF algorithm works on the following steps:

Take at random p data points from the training set

Build a decision tree associated with these p data points

Take the number N of trees to build and repeat a and b steps

For a new data point, make each one of the N tree trees predict the value of y for the data point and assign the new data point to the average of all of the predicted y values.

Random forest algorithm is one of the supervised machine learning algorithms that are selected as the predictive model for daily rainfall prediction using environmental input variables or features. Random forest regression is operated by constructing a multitude of decision trees at the training time and outputting the class that is the mode of mean prediction or regression of the individual trees. According to [ 2 ] the RF algorithm is efficient for large datasets and a good experimental result is obtained using large datasets having a large proportion of the data is missing.

XGBoost gradient descent

XGBoost stands for e X treme G radient B oosting; it is a specific implementation of the Gradient Boosting method which uses more accurate approximations to find the best tree model. XGBoost is implemented for the supervised machine learning problem that has data with multiple features of x i to predict a target variable y i . Most authors use XGBoost for different regression and classification problems due to the speed and prediction accuracy of the algorithm.

Extreme Gradient Boosting (XGBoost) is one of the efficient [ 19 ] algorithms in the gradient descant that has a linear model algorithm and tree learning algorithm. It is faster than other gradient descent algorithms because of the parallel computation on a single machine. This paper chooses the XGBoosting algorithm for experiments to predict the target variable daily rainfall intensity using various input or dependent environmental variables. XGBoost is a powerful algorithm that is fast learning through parallel and distributed computing and offers efficient memory usage that produces a robust solution.

Methodology

Data collection.

For this study, the raw data were collected from the regional meteorological station at Bahir Dar City, Ethiopia. Ten data features such as year, month, date, evaporation, sunshine, maximum temperature, minimum temperature, humidity, wind speed, and rainfall were included. The meteorology station records the values of the environmental variable every day for each year directly from the devices in the station. Then, the data were recorded in the Microsoft Excel file tabular format. The year and the days of the month were arranged in the row of tables related to environmental variables in the column of the table.

The raw data recorded at the station for 20 years (1999–2018) were used for the study.

Data preprocessing

The data preprocessing step included the data conversion, manage missing values, categorical encoding, and splitting dataset for training and testing dataset. A total of 20 years (1999–2018) data were collected from the meteorology office. Since the data were raw, they contained missing values, and wrongly encoded values so that the missing values of the target variable were removed and the other features were filled using the mean of the data.

In the meteorology office, the raw data were also arranged in a year based and the attributes in rows that need to combine and rearrange features in columns. Thus, data were converted from excel data to CSV data.

Encoding the dataset was performed and then the dataset was prepared for the experiment. The important features for rainfall prediction were selected and the dataset splitting as 80% for training and 20% for testing were considered as an input for the model.

In this paper, the rainfall was predicted using a machine learning technique. Three machine learning algorithms such as Multivariate Linear Regression (MLR), Random Forest (RF), and gradient descent XGBoost were analyzed which took input variables having moderately and strongly related environmental variables with rainfall. The better machine learning algorithm was identified and reported based on the performance measure using RMSE and MAE (Fig.  2 ).

figure 2

Machine Learning Model

Measuring performance

Pearson correlation was used to measure the strength of the relationship between two variables. The two variables can be positively or negatively correlated and no relationship between the two variables if the Pearson correlation coefficient is zero. The Pearson correlation coefficient model is mathematically described as:

where r xy is the Pearson correlation coefficient, {(x 1 , y 1 ), (x 2 , y 2 ), …, (x n , y n )} are paired data consisting of n pairs and \(\overline{x} \,and{ }\overline{y}\) are mean of x and y respectively.

To show the relevant features of the environmental variables to predict daily rainfall intensity, the following Pearson coefficient ranges and interpretations are used as shown in Table 1 .

The machine learning algorithms take the input data features which are selected using the Pearson correlation coefficient as relevant features.

The rainfall prediction performance of each machine learning algorithm that was used in this study was measured using Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE) to compare which machine learning algorithms outperform better than others. RMSE and MAE were two of the most common metrics used to measure accuracy for continuous variables. The MAE measures the average magnitude of the errors in a set of forecasts and the corresponding observation, without considering their direction.

The RMSE is a quadratic scoring rule which measures the average magnitude of the error. It’s the square root of the average of squared differences between prediction and actual observation.

RMSE gives a relatively high weight to large errors. This means the RMSE is most useful when large errors are particularly undesirable. The MAE and the RMSE can be used together to diagnose the variation in the errors in a set of forecasts. The RMSE will always be larger or equal to the MAE; the greater difference between them, the greater the variance in the individual errors in the sample. If the RMSE = MAE, then all the errors are of the same magnitude.

The main objective of this study was to identify the relevant atmospheric features that cause rainfall and predict the intensity of daily rainfall using machine learning techniques. Consequently, the research findings are summarized below.

To choose the environmental variables that correlate with the rainfall, the Pearson correlation was analyzed on the environmental variables presented in Table 1 above. Since the dataset is large, the variables that correlate greater than 0.20 with rainfall were considered as the participant environmental features to the experiment for rainfall prediction. Hence, to predict the amount of daily rainfall, the results of environmental attributes relevant to daily rainfall prediction like Evaporation, Relative Humidity, Sunshine, Maximum Daily Temperature, and Minimum Daily Temperature are shown in Table 2 .

The Pearson Correlation coefficient experimental results on the given data showed that the attributes such as year, month, day, and wind speed had no significant impact on the prediction of rainfall. This paper took environmental values which had a correlation coefficient greater than 0.2 and analyzed the rainfall prediction. The highly correlated environmental features for rainfall prediction were relative humidity and the daily sunshine which measured the Pearson coefficient of 0.401 and 0.351 respectively.

The machine learning model used the selected environmental features as an input for the algorithms. The regression models were implemented in python and the performances of the MLR, RF, and XGBoost were measured using MAE and RMSE.

In Table 3 above, the comparison of results of the three algorithms such as the MLR, RF, and XGBoost was made. The performance results indicated that XGBoost Gradient descent outperformed MLR and RF. The MAE and RMSE values of the XGBoost gradient descent algorithms were 3.58 and 7.85 respectively so that The XGBoost algorithm predicted the rainfall using relevant selected environmental features better than the RF and the MLR.

The environmental features used in this study taken from the meteorological station collected by measuring devices are analyzed their relevance on the impact of rainfall and selected the relevant features based on experiment result of Pearson correlation values as shown in Table 2 for the daily rainfall prediction. This paper took environmental features which had a correlation coefficient greater than 0.2 and analyzed the rainfall prediction. Similarly, Manandhar et al. [ 7 ] identifies the five important environmental features such as Temperature, Relative Humidity, Dew Point, Solar Radiation, precipitable water vapor using a degree of correlation among each feature. According to the experiment result of the study, a high negative correlation coefficient of around − 0.9 is observed between Temperature and Relative Humidity. The researcher Prabakaran et al. [ 15 ] used the year, temperature, cloud cover and year attribute for the experiment without analyzing the relationship between environmental features, and Gnanasankaran and Ramaraj, [ 14 ] did not show the impact of environmental features on rainfall rather used the monthly and annual rainfall data to predict the average yearly rainfall.

This study used the relevant environmental feature to train and test the three machine learning models such as RF, MLR, and XGBoost for the daily rainfall amount prediction. The performance of these machine learning models was measured using MAE and RMSE. The RAM of RF, MLR, XGBoost are 4.49, 4.97, and 3.58, and the RMSE is 8.82, 8.61, and 7.85 respectively. Similarly, the researcher Manandhar et al. [ 7 ] used data-driven machine learning algorithms to predict the annual rainfall using the selected relevant environmental features and recorded an overall accuracy of 79.6%. The researcher considered the attributes to predict the amount of yearly rainfall amount by taking the average value of temperature, cloud cover, and rainfall for a year as an input. The correlation analysis between attributes was not assessed. The average error percentage of the yearly rainfall prediction using modified linear regression was 7%. The researcher Gnanasankaran and Ramaraj [ 14 ], did not show the impact of environmental features on rainfall. The research took the monthly and annual rainfall for the prediction of rainfall and measures the performance using RMSE which was 0.1069 and MAE which was 0.0833 using multiple linear regression.

Hence, this study assessed the impact of environmental features on the daily rainfall intensity using the Pearson correlation and selected the relevant environmental variables. The relevant features are used as an input for the daily rainfall amount prediction machine learning models and the performance of the models are measured using MAE and RMSE.

Rainfall Prediction is the application area of data science and machine learning to predict the state of the atmosphere. It is important to predict the rainfall intensity for effective use of water resources and crop production to reduce mortality due to flood and any disease caused by rain. This paper analyzed various machine learning algorithms for rainfall prediction. Three machine learning algorithms such as MLR, FR, and XGBoost were presented and tested using the data collected from the meteorological station at Bahir Dar City, Ethiopia.

The relevant environmental features for rainfall prediction were selected using the Pearson correlation coefficient. The selected features were used as the input variables for the machine learning model used in this paper. A comparison of results among the three algorithms (MLR, RF, and XGBoost) was made and the results showed that the XGBoost was a better-suited machine learning algorithm for daily rainfall amount prediction using selected environmental features. The accuracy of the rainfall amount prediction may increase if the sensor data is incorporated for the study. But the sensor data was not considered in this study.

The Rainfall prediction accuracy can be improved using sensor and meteorological datasets with additional different environmental features. Hence, in future work, big data analysis can be used for rainfall prediction if the sensor and meteorological datasets are used for the daily rainfall amount prediction study.

Availability of data and materials

The raw data collected from the North West of Ethiopia Meteorology Agency is available for researchers if it is requested and the materials that the authors used are available at the authors' hands.

Abbreviations

Extreme Gradient Boosting

Multivariate Linear Regression

Random Forest

Root Mean Squared Error

Mean Absolute Error

Support Vector Machine

Decision Tree

Ehsan MA. Seasonal predictability of Ethiopian Kiremt rainfall and forecast skill of ECMWF's SEAS5 model. Climate Dynamics. 2021; 1–17.

Kusiak A, Verma AP, Roz E. Modeling and prediction of rainfall using radar reflectivity data: a data-mining approach. IEEE Trans Geosci Remote Sens. 2013;51:2337–42.

Article   Google Scholar  

Chowdari KK, Girisha R, Gouda KC. A study of rainfall over India using data mining. In 2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT). IEEE: New York. 2015; pp. 44–47.

Namitha K, Jayapriya A, SanthoshKumar G. Rainfall prediction using artificial neural network on map-reduce framework. ACM. 2015. https://doi.org/10.1145/2791405.2791468 .

Tharun VP, Prakash R, Devi SR. Prediction of Rainfall Using Data Mining Techniques. In 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT). IEEE Xplore. 2018; pp. 1507–1512.

Zainudin S, Jasim DS, Bakar AA. Comparative analysis of data mining techniques for malaysian rainfall prediction. Int J Adv Sci Eng Inform Technol. 2016;6(6):1148–53.

Manandhar S, Dev S, Lee YH, Meng YS, Winkler S. A data-driven approach for accurate rainfall prediction. IEEE Trans Geosci Remote Sens. 2019;5(11):9323–31.

Arnav G, Kanchipuram Tamil Nadu. Rainfall prediction using machine learning. Int J Innovative Sci Res Technol. 2019. 56–58.

Aswin S, Geetha P, Vinayakumar R. Deep learning models for the prediction of rainfall. In 2018 International Conference on Communication and Signal Processing (ICCSP). IEEE: New York. 2018; pp. 0657–0661.

Zeelan BCMAK, Bhavana N, Bhavya P, Sowmya V. Rainfall prediction using machine learning & deep learning techniques. Proceedings of the International Conference on Electronics and Sustainable Communication Systems (ICESC 2020). Middlesex University: IEEE Xplore. 2020; pp. 92–97.

Vijayan R, Mareeswari V, Mohankumar P, Gunasekaran G, Srikar K, (JUNE,. Estimating rainfall prediction using machine learning techniques on a dataset. Int J Sci Technol Res. 2020;9(06):440–5.

Google Scholar  

Chaudhari MM, Choudhari DN. Study of various rainfall estimation & prediction techniques using data mining. Am J Eng Res. 2017;6(7):137–9.

Thirumalai C, Harsha KS, Deepak ML, Krishna KC. Heuristic prediction of rainfall using machine learning techniques. In 2017 International Conference on Trends in Electronics and Informatics (ICEI). IEEE: New York. 2017; pp. 1114–1117.

Gnanasankaran N, Ramaraj E. A multiple linear regression model to predict rainfall using indian meteorological data. Int J Adv Sci Technol. 2020;29(8):746–58.

Prabakaran S, Kumar PN, Tarun PSM. Rainfall prediction using modified linear regression. ARPN J Eng Appl Sci. 2017;12(12):3715–8.

Balan MS, Selvan JP, Bisht HR, Gadgil YA, Khaladkar IR, Lomte VM. Rainfall prediction using deep learning on highly non-linear data. Int J Res Eng Sci Manage. 2019;2(3):590–2.

Sarker IH. Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions. SN Comput Sci. 2021;2(6):1–20.

Article   MathSciNet   Google Scholar  

Sarker IH. Machine learning: algorithms, real-world applications and research directions. SN Comput Sci. 2021;2(3):1–21.

MathSciNet   Google Scholar  

Srinivas AST, Somula R, Govinda K, Saxena A, Reddy PA. Estimating rainfall using machine learning strategies based on weather radar data. Int J Commun Syst. 2020;33(13):1–11.

Download references

Acknowledgements

We gratefully acknowledge the North West of Ethiopia Meteorology Agency for providing meteorological data, valuable information, and kind help for the completion of this study.

There are no funding organizations or individuals.

Author information

Authors and affiliations.

Bahir Dar University, Bahir Dar Institute of Technology, Bahir Dar, Ethiopia

Chalachew Muluken Liyew & Haileyesus Amsaya Melese

You can also search for this author in PubMed   Google Scholar

Contributions

CML designed and coordinated this research, drafted the manuscript, and experiment. CML and HAM carried out the data collection and data analysis. Both the authors read and approved the final manuscript.

Corresponding author

Correspondence to Chalachew Muluken Liyew .

Ethics declarations

Ethics approval and consent to participate.

Not applicable.

Consent for publication

Competing interests.

The authors declare that they have no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Liyew, C.M., Melese, H.A. Machine learning techniques to predict daily rainfall amount. J Big Data 8 , 153 (2021). https://doi.org/10.1186/s40537-021-00545-4

Download citation

Received : 11 August 2021

Accepted : 23 November 2021

Published : 07 December 2021

DOI : https://doi.org/10.1186/s40537-021-00545-4

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Machine learning
  • Rainfall prediction

rainfall prediction using machine learning thesis

Rainfall Prediction Using Machine Learning Models: Literature Survey

Cite this chapter.

rainfall prediction using machine learning thesis

  • Eslam A. Hussein 6 ,
  • Mehrdad Ghaziasgar 6 ,
  • Christopher Thron 7 ,
  • Mattia Vaccari 8 &
  • Yahlieel Jafta 6  

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1006))

837 Accesses

6 Citations

Research on rainfall prediction contributes to different fields that have a huge impact on our daily life. With the advancement of computer technology, machine learning has been extensively used in the area of rainfall prediction. However, some papers suggest that applications of machine learning in different fields are deficient is some respects. This chapter performs a review on 66 research papers that use machine learning tools to predict rainfall. The papers are examined in terms of the source of the data, output objective, input features, pre-processing, model used, and the results. The review shows questionable aspects present in many studies. In particular, many studies lack a baseline predictor for comparison. Also, many references do not provide error bars for prediction errors, so that the significance of differences between prediction methods cannot be determined. In addition, some references utilize practices that permit data leakage, leading to overestimates of predictive accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Available as EPUB and PDF
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
  • Durable hardcover edition

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abbot, J., & Marohasy, J. (2016). Forecasting monthly rainfall in the western Australian wheat-belt up to 18-months in advance using artificial neural networks. In Australasian Joint Conference on Artificial Intelligence (pp. 71–87). Berlin: Springer.

Google Scholar  

Abbot, J., & Marohasy, J. (2017). Application of artificial neural networks to forecasting monthly rainfall one year in advance for locations within the Murray Darling basin, Australia. International Journal of Sustainable Development and Planning, 12 (8), 1282–1298.

Article   Google Scholar  

Aguasca-Colomo, R., Castellanos-Nieves, D., & Méndez, M. (2019). Comparative analysis of rainfall prediction models using machine learning in islands with complex orography: Tenerife island. Applied Sciences, 9 (22), 4931.

Amiri, M. A., Amerian, Y., & Mesgari, M. S. (2016). Spatial and temporal monthly precipitation forecasting using wavelet transform and neural networks, Qara-Qum catchment, Iran. Arabian Journal of Geosciences, 9 (5), 421.

Aswin, S., Geetha, P., & Vinayakumar, R. (2018). Deep learning models for the prediction of rainfall. In 2018 International Conference on Communication and Signal Processing (ICCSP) (pp. 0657–0661). Piscataway: IEEE.

Chapter   Google Scholar  

Ayzel, G., Heistermann, M., Sorokin, A., Nikitin, O., & Lukyanova, O. (2019). All convolutional neural networks for radar-based precipitation nowcasting. Procedia Computer Science, 150 , 186–192.

Balamurugan, M. S., & Manojkumar, R. (2021). Study of short term rain forecasting using machine learning based approach. Wireless Networks, 27 , 5429–5434.

Banadkooki, F. B., Ehteram, M., Ahmed, A. N., Fai, C. M., Afan, H. A., Ridwam, W. M., Sefelnasr, A., & El-Shafie, A. (2019). Precipitation forecasting using multilayer neural network and support vector machine optimization based on flow regime algorithm taking into account uncertainties of soft computing models. Sustainability, 11 (23), 6681.

Barnett, A. G., Baker, P., & Dobson, A. (2012). Analysing seasonal data. R Journal, 4 (1), 5–10.

Beheshti, Z., Firouzi, M., Shamsuddin, S. M., Zibarzani, M., & Yusop, Z. (2016). A new rainfall forecasting model using the CAPSO algorithm and an artificial neural network. Neural Computing and Applications, 27 (8), 2551–2565.

Bojang, P. O., Yang, T.-C., Pham, Q. B., & Yu, P.-S. (2020). Linking singular spectrum analysis and machine learning for monthly rainfall forecasting. Applied Sciences, 10 (9), 3224.

Boonyuen, K., Kaewprapha, P., & Srivihok, P. (2018). Daily rainfall forecast model from satellite image using convolution neural network. In 2018 IEEE International Conference on Information Technology (pp. 1–7).

Boonyuen, K., Kaewprapha, P., Weesakul, U., & Srivihok, P. (2019). Convolutional neural network inception-v3: A machine learning approach for leveling short-range rainfall forecast model from satellite image. In International Conference on Swarm Intelligence (pp. 105–115). Berlin: Springer.

Canchala, T., Alfonso-Morales, W., Carvajal-Escobar, Y., Cerón, W. L., & Caicedo-Bravo, E. (2020). Monthly rainfall anomalies forecasting for southwestern Colombia using artificial neural networks approaches. Water, 12 (9), 2628.

Castro, R., Souto, Y. M., Ogasawara, E., Porto, F., & Bezerra, E. (2020). STConvS2S: Spatiotemporal convolutional sequence to sequence network for weather forecasting. Neurocomputing, 426 , 285–298.

Chattopadhyay, A., Hassanzadeh, P., & Pasha, S. (2020). Predicting clustered weather patterns: A test case for applications of convolutional neural networks to spatio-temporal climate data. Sci. Rep. 10 (1), 1–13.

Chen, L., Cao, Y., Ma, L., & Zhang, J. (2020). A deep learning based methodology for precipitation nowcasting with radar. Earth and Space Science, 7 , e2019EA000812.

Chen, K., Liu, J., Guo, S., Chen, J., Liu, P., Qian, J., Chen, H., & Sun, B. (2016). Short-term precipitation occurrence prediction for strong convective weather using fy2-g satellite data: A case study of Shenzhen, South China. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, 41 , 215.

Chhetri, M., Kumar, S., Pratim Roy, P., & Kim, B.-G. (2020). Deep BLSTM-GRU model for monthly rainfall prediction: A case study of Simtokha, Bhutan. Remote Sensing, 12 (19), 3174.

Cristian, M. (2018). Average monthly rainfall forecast in Romania by using k-nearest neighbors regression. Analele Universităţii Constantin Brâncuşi din Târgu Jiu: Seria Economie, 1 (4), 5–12.

Dacrema, M. F., Cremonesi, P., & Jannach, D. (2019). Are we really making much progress? A worrying analysis of recent neural recommendation approaches. In Proceedings of the 13th ACM Conference on Recommender Systems (pp. 101–109).

Damavandi, H. G., & Shah, R. (2019). A learning framework for an accurate prediction of rainfall rates. arXiv:1901.05885.

Dash, Y., Mishra, S. K., & Panigrahi, B. K. (2018). Rainfall prediction for the Kerala state of India using artificial intelligence approaches. Computers & Electrical Engineering, 70 , 66–73.

Delleur, J. W., & Kavvas, M. L. (1978). Stochastic models for monthly rainfall forecasting and synthetic generation. Journal of Applied Meteorology, 17 (10), 1528–1536.

Diez-Sierra, J., & del Jesus, M. (2020). Long-term rainfall prediction using atmospheric synoptic patterns in semi-arid climates with statistical and machine learning methods. Journal of Hydrology, 586 , 124789.

Du, Y., Berndtsson, R., An, D., Zhang, L., Yuan, F., Uvo, C. B., & Hao, Z. (2019). Multi-space seasonal precipitation prediction model applied to the source region of the Yangtze river, China. Water, 11 (12), 2440.

Du, J., Liu, Y., & Liu, Z. (2018). Study of precipitation forecast based on deep belief networks. Algorithms, 11 (9), 132.

Du, J., Liu, Y., Yu, Y., & Yan, W. (2017). A prediction of precipitation data based on support vector machine and particle swarm optimization (PSO-SVM) algorithms. Algorithms, 10 (2), 57.

Article   MathSciNet   MATH   Google Scholar  

Duong, T. A., Bui, M. D., & Rutschmann, P. (2018). A comparative study of three different models to predict monthly rainfall in Ca Mau, Vietnam. In Wasserbau-Symposium Graz 2018. Wasserwirtschaft–Innovation aus Tradition. Tagungsband. Beiträge zum 19. Gemeinschafts-Symposium der Wasserbau-Institute TU München, TU Graz und ETH Zürich (p. Paper–G5).

Gao, L., Wei, F., Yan, Z., Ma, J., & Xia, J. (2019). A study of objective prediction for summer precipitation patterns over eastern China based on a multinomial logistic regression model. Atmosphere, 10 (4), 213.

Haidar, A., & Verma, B. (2018). Monthly rainfall forecasting using one-dimensional deep convolutional neural network. IEEE Access, 6 , 69053–69063.

Htike, K. K., & Khalifa, O. O. (2010). Rainfall forecasting models using focused time-delay neural networks. In International Conference on Computer and Communication Engineering (ICCCE’10) (pp. 1–6). Piscataway: IEEE.

Huang, M., Lin, R., Huang, S., & Xing, T. (2017). A novel approach for precipitation forecast via improved k-nearest neighbor algorithm. Advanced Engineering Informatics, 33 , 89–95.

Hung, N. Q., Babel, M. S., Weesakul, S., & Tripathi, N. K. (2009). An artificial neural network model for rainfall forecasting in Bangkok, Thailand. Hydrology and Earth System Sciences, 13 (8), 1413–1425.

Hussein, E., Ghaziasgar, M., & Thron, C. (2020). Regional rainfall prediction using support vector machine classification of large-scale precipitation maps. In 2020 IEEE 23rd International Conference on Information Fusion (FUSION) (pp. 1–8). Piscataway: IEEE.

Hussein, E. A., Ghaziasgar, M., Thron, C., Vaccari, M., & Bagula, A. (2021). Basic statistical estimation outperforms machine learning in monthly prediction of seasonal climatic parameters. Atmosphere, 12 (5), 539.

Jing, J., Li, Q., & Peng, X. (2019). MLC-LSTM: Exploiting the spatiotemporal correlation between multi-level weather radar echoes for echo sequence extrapolation. Sensors, 19 (18), 3988.

Karimi, H. A. (2014). Big data: Techniques and technologies in geoinformatics . Boca Raton: CRC Press.

Book   Google Scholar  

Kashiwao, T., Nakayama, K., Ando, S., Ikeda, K., Lee, M., & Bahadori, A. (2017). A neural network-based local rainfall prediction system using meteorological data on the internet: A case study using data from the Japan meteorological agency. Applied Soft Computing, 56 , 317–330.

Kumar, D., Singh, A., Samui, P., & Jha, R. K. (2019). Forecasting monthly precipitation using sequential modelling. Hydrological Sciences Journal, 64 (6), 690–700.

Lakshmaiah, K., Murali Krishna, S., & Eswara Reddy, B. (2016). Application of referential ensemble learning techniques to predict the density of rainfall. In 2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT) (pp. 233–237). Piscataway: IEEE.

Lee, J., Kim, C.-G., Lee, J. E., Kim, N. W., & Kim, H. (2018). Application of artificial neural networks to rainfall forecasting in the Geum river basin, Korea. Water, 10 (10), 1448.

Article   MathSciNet   Google Scholar  

Lin, J. (2019). The neural hype and comparisons against weak baselines. In ACM SIGIR forum (vol. 52, pp. 40–51). New York: ACM.

Lu, J., Hu, W., & Zhang, X. (2018). Precipitation data assimilation system based on a neural network and case-based reasoning system. Information, 9 (5), 106.

Ludewig, M., & Jannach, D. (2018). Evaluation of session-based recommendation algorithms. User Modeling and User-Adapted Interaction, 28 (4–5), 331–390.

Mallika, M., & Nirmala, M. (2016). Chennai annual rainfall prediction using k-nearest neighbour technique. International Journal of Pure and Applied Mathematics, 109 (8), 115–120.

Manandhar, S., Dev, S., Lee, Y. H., Meng, Y. S., & Winkler, S. (2019). A data-driven approach for accurate rainfall prediction. IEEE Transactions on Geoscience and Remote Sensing, 57 (11), 9323–9331.

Mehdizadeh, S., Behmanesh, J., & Khalili, K. (2018). New approaches for estimation of monthly rainfall based on GEP-ARCH and ANN-ARCH hybrid models. Water Resources Management, 32 (2), 527–545.

Mehr, A. D., Nourani, V., Khosrowshahi, V. K., & Ghorbani, M. A. (2019). A hybrid support vector regression–firefly model for monthly rainfall forecasting. International Journal of Environmental Science and Technology, 16 (1), 335–346.

Mishra, N., & Kushwaha, A. (2019). Rainfall prediction using gaussian process regression classifier. International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), 8 (8), 392–397.

Mohamadi, S., Ehteram, M., & El-Shafie, A. (2020). Accuracy enhancement for monthly evaporation predicting model utilizing evolutionary machine learning methods. International Journal of Environmental Science and Technology, 17 , 1–24.

Mosavi, A., Ozturk, P., & Chau, K.-W. (2018). Flood prediction using machine learning models: Literature review. Water, 10 (11), 1536.

Nasseri, M., Asghari, K., & Abedini, M. J. (2008). Optimized scenario for rainfall forecasting using genetic algorithm coupled with artificial neural network. Expert Systems with Applications, 35 (3), 1415–1421.

Nielsen, A. (2020). Practical time series analysis: Prediction with statistics and machine learning . Sebastopol: O’Reilly.

Nourani, V., Uzelaltinbulat, S., Sadikoglu, F., & Behfar, N. (2019). Artificial intelligence based ensemble modeling for multi-station prediction of precipitation. Atmosphere, 10 (2):80.

Oswal, N. (2019). Predicting rainfall using machine learning techniques. arXiv:1910.13827.

Pan, B., Hsu, K., AghaKouchak, A., & Sorooshian, S. (2019). Improving precipitation estimation using convolutional neural network. Water Resources Research, 55 (3), 2301–2321.

Pantanowitz, A., & Marwala, T. (2009). Missing data imputation through the use of the random forest algorithm. In Advances in Computational Intelligence (pp. 53–62). Berlin: Springer.

Parmar, A., Mistree, K., & Sompura, M. (2017). Machine learning techniques for rainfall prediction: A review. In International Conference on Innovations in Information Embedded and Communication Systems .

Patel, M., Patel, A., Ghosh, R. (2018). Precipitation nowcasting: Leveraging bidirectional LSTM and 1d CNN. arXiv:1810.10485.

Peng, Y., Zhao, H., Zhang, H., Li, W., Qin, X., Liao, J., Liu, Z., Li, J. (2019). An extreme learning machine and gene expression programming-based hybrid model for daily precipitation prediction. International Journal of Computational Intelligence Systems, 12 (2), 1512–1525.

Pham, Q. B., Abba, S. I., Usman, A. G., Linh, N. T. T., Gupta, V., Malik, A., Costache, R., Vo, N. D., & Tri, D. Q. (2019). Potential of hybrid data-intelligence algorithms for multi-station modelling of rainfall. Water Resources Management, 33 (15), 5067–5087.

Ramsundram, N., Sathya, S., & Karthikeyan, S. (2016). Comparison of decision tree based rainfall prediction model with data driven model considering climatic variables. Irrigation and Drainage Systems Engineering, 5 (3).

Sardeshpande, K. D., & Thool, V. R. (2019). Rainfall prediction: A comparative study of neural network architectures. In Emerging Technologies in Data Mining and Information Security (pp. 19–28). Berlin: Springer.

Sato, R., Kashima, H., & Yamamoto, T. (2018). Short-term precipitation prediction with skip-connected PredNET. In International Conference on Artificial Neural Networks (pp. 373–382). Berlin: Springer.

Shah, A. D., Bartlett, J. W., Carpenter, J., Nicholas, O., & Hemingway, H. (2014). Comparison of random forest and parametric imputation models for imputing missing data using mice: A caliber study. American Journal of Epidemiology, 179 (6), 764–774.

Shenify, M., Danesh, A. S., Gocić, M., Taher, R. S., Wahab, Ainuddin, W. A., Gani, A., Shamshirband, S., & Petković, D. (2016). Precipitation estimation using support vector machine with discrete wavelet transform. Water Resources Management, 30 (2), 641–652.

Shi, E., Li, Q., Gu, D., & Zhao, Z. (2017). Convolutional neural networks applied on weather radar echo extrapolation. In DEStech Transactions on Computer Science and Engineering (case), 695–704. DEStech Publications.

Shi, X., Chen, Z., Wang, H., Yeung, D., Wong, W., & Woo, W. C. (2015). Convolutional LSTM network: A machine learning approach for precipitation nowcasting. ArXiv, abs/1506.04214.

Shi, X., Gao, Z., Lausen, L., Wang, H., Yeung, D.-Y., Wong, W., Woo, W.-C. (2017). Deep learning for precipitation nowcasting: A benchmark and a new model . In Advances in Neural Information Processing Systems (pp. 5617–5627).

Shi, X., & Yeung, D.-Y. (2018). Machine learning for spatiotemporal sequence forecasting: A survey. arXiv:1808.06865.

Singh, G., & Kumar, D. (2019). Hybrid prediction models for rainfall forecasting. In 2019 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence) (pp. 392–396). Piscataway: IEEE.

Singh, S., Sarkar, S., & Mitra, P. (2017). Leveraging convolutions in recurrent neural networks for doppler weather radar echo prediction. In International Symposium on Neural Networks (pp. 310–317). Berlin: Springer.

Sulaiman, J., & Wahab, S. H. (2018). Heavy rainfall forecasting model using artificial neural network for flood prone area. In IT Convergence and Security 2017 (pp. 68–76). Berlin: Springer.

Tang, F., & Ishwaran, H. (2017). Random forest missing data algorithms. Statistical Analysis and Data Mining: The ASA Data Science Journal, 10 (6), 363–377.

Tran, Q.-K., & Song, S.-K. (2019a). Computer vision in precipitation nowcasting: Applying image quality assessment metrics for training deep neural networks. Atmosphere, 10 (5), 244.

Tran, Q.-K., & Song, S.-K. (2019b). Multi-channel weather radar echo extrapolation with convolutional recurrent neural networks. Remote Sensing, 11 (19), 2303.

Valencia-Payan, C., & Corrales, J. C. (2018). A rainfall prediction tool for sustainable agriculture using random forest. In Mexican International Conference on Artificial Intelligence (pp. 315–326). Berlin: Springer.

Wang, Y., Long, M., Wang, J., Gao, Z., & Philip, S. Y. (2017). PredRNN: Recurrent neural networks for predictive learning using spatiotemporal LSTMs. In Advances in Neural Information Processing Systems (pp. 879–888).

Weesakul, U., Kaewprapha, P., Boonyuen, K., & Mark, O. (2018). Deep learning neural network: A machine learning approach for monthly rainfall forecast, case study in eastern region of Thailand. Engineering and Applied Science Research, 45 (3), 203–211.

Xu, L., Chen, N., Zhang, X., & Chen, Z. (2020). A data-driven multi-model ensemble for deterministic and probabilistic precipitation forecasting at seasonal scale. Climate Dynamics, 54 , 3355–3374.

Yu, P.-S., Yang, T.-C., Chen, S.-Y., Kuo, C.-M., & Tseng, H.-W. (2017). Comparison of random forests and support vector machine for real-time radar-derived rainfall forecasting. Journal of Hydrology, 552 , 92–104.

Zainudin, S., Jasim, D. S., & Bakar, A. A. (2016). Comparative analysis of data mining techniques for Malaysian rainfall prediction. International Journal on Advanced Science, Engineering and Information Technology, 6 (6), 1148–1153.

Zhan, C., Wu, F., Wu, Z., & Chi, K. T. (2019). Daily rainfall data construction and application to weather prediction. In 2019 IEEE International Symposium on Circuits and Systems (ISCAS) (pp. 1–5). Piscataway: IEEE.

Zhang, C.-J., Zeng, J., Wang, H.-Y., Ma, L.-M., & Chu, H. (2020). Correction model for rainfall forecasts using the LSTM with multiple meteorological factors. Meteorological Applications, 27 (1), e1852.

Zhang, P., Jia, Y., Gao, J., Song, W., & Leung, H. K. N. (2018). Short-term rainfall forecasting using multi-layer perceptron. IEEE Transactions on Big Data, 6 , 93–106.

Zhuang, W. Y., & Ding, W. (2016). Long-lead prediction of extreme precipitation cluster via a spatiotemporal convolutional neural network. In Proceedings of the 6th International Workshop on Climate Informatics: CI .

Download references

Acknowledgements

E.A.H. acknowledges financial support from the South African National Research Foundation (NRF CSUR Grant Number 121291 for the HIPPO project) and from the Telkom-Openserve-Aria Technologies Center of Excellence at the Department of Computer Science of the University of the Western Cape.

Appendix 1: List of Abbreviations

ML Machine learning

AD Author defined

ANNs Artificial neural networks

CNNs Convolution neural networks

LSTMs Long short-term memory

ConvLSTMs Convolutions layers with Long short-term memory

RF Random forest

RF SVMs Support vector machines

DT Decision tress

XGB Extreme gradient boosting

LogReg Logistic regression

MLR Multi linear regression

KNNs K-nearest neighbour

RMSE Root mean square error

MAE mean absolute error

CA Classification accuracy

pre precision

f1 f1-score

PACF Partial autocorrelation function

ACF Autocorrelation function

PCA principle component analysis

NOAA National Oceanic and Atmospheric Administration

Appendix 2: Summary Tables for References

This appendix contains four tables which summarize the findings for the reviewed papers for long-term data Tables 4.1 and 4.2 , and short-term data Tables 4.3 and 4.4 . Tables 4.1 and 4.3 contain information regarding the source, period, region, input, output; while Tables 4.2 and 4.4 include information about the pre-processing tools, data leakage, and the ML used.

Author information

Authors and affiliations.

Department of Computer Science, University of the Western Cape, Cape Town, South Africa

Eslam A. Hussein, Mehrdad Ghaziasgar & Yahlieel Jafta

Department of Science and Mathematics, Texas A&M University-Central, Killeen, TX, USA

Christopher Thron

Department of Physics and Astronomy, University of the Western Cape, Cape Town, South Africa

Mattia Vaccari

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Eslam A. Hussein .

Editor information

Editors and affiliations.

Computing and Mathematical Sciences, Liverpool John Moores University, Liverpool, UK

Mohamed Alloghani

The UAE Artificial Intelligence Office, Prime Minister’s Office at the Ministry of Cabinet Affairs and the Future, Dubai, UAE

Department of Science and Mathematics, Texas A&M University-Central Texas, Killeen, TX, USA

College of Computer Studies, International University of Africa, Khartoum, Sudan

Saad Subair

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Hussein, E.A., Ghaziasgar, M., Thron, C., Vaccari, M., Jafta, Y. (2022). Rainfall Prediction Using Machine Learning Models: Literature Survey. In: Alloghani, M., Thron, C., Subair, S. (eds) Artificial Intelligence for Data Science in Theory and Practice. Studies in Computational Intelligence, vol 1006. Springer, Cham. https://doi.org/10.1007/978-3-030-92245-0_4

Download citation

DOI : https://doi.org/10.1007/978-3-030-92245-0_4

Publisher Name : Springer, Cham

Print ISBN : 978-3-030-92244-3

Online ISBN : 978-3-030-92245-0

eBook Packages : Engineering Engineering (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to  upgrade your browser .

Enter the email address you signed up with and we'll email you a reset link.

  • We're Hiring!
  • Help Center

paper cover thumbnail

Rainfall Prediction using Machine Learning

Profile image of IJRASET Publication

2022, International Journal for Research in Applied Science & Engineering Technology (IJRASET)

In India, Agriculture contributes major role to Indian economy. For agriculture, Rainfall is important but during these days' rainfall prediction has become a major challenging problem. Good prediction of rainfall provides knowledge and know in advance to take precautions and have better strategy about theirs crops. Global warming is also having severe effect on nature as well as mankind and it accelerates the change in climatic conditions. Because of its air is getting warmer and level of ocean is rising, leads to flood and cultivated field is changing into drought. Due to adverse climatic change leads to unseasonable and unreasonable amount of rainfall. To predict Rainfall is one of the best techniques to know about rainfall and climate. The main aim of this study revolves around providing correct climate description to the clients from various perspectives like agriculture, researchers, generation of power etc. to grasp the need of transformation in climate and its parameters like temperature, humidity, precipitation, wind speed that eventually directs to projection of rainfall. Rainfall also depends on geographic locations hence is an arduous task to predict. Machine Learning is the evolving subset of an AI, that helps in predicting the rainfall. In this research paper, we will be using UCI repository dataset with multiple attributes for predicting the rainfall. The main aim of this study is to develop the rainfall prediction system and predict the rainfall with better accuracy with the use of Machine Learning classification algorithms.

Related Papers

Roshitha kolapalli

Rainfall prediction is important as heavy rainfall can lead to many disasters. The prediction helps people to take preventive measures and moreover the prediction should be accurate. There are two types of prediction short term rainfall prediction and long term rainfall. Prediction mostly short term prediction can gives us the accurate result. The main challenge is to build a model for long term rainfall prediction. Heavy precipitation prediction could be a major drawback for earth science department because it is closely associated with the economy and lifetime of human. It’s a cause for natural disasters like flood and drought that square measure encountered by individuals across the world each year. Accuracy of rainfall statement has nice importance for countries like India whose economy is basically dependent on agriculture. The dynamic nature of atmosphere, applied mathematics techniques fail to provide sensible accuracy for precipitation statement. The prediction of precipitat...

rainfall prediction using machine learning thesis

International Journal of Recent Technology and Engineering

Roshitha Kolapalli

Rainfall is one of the major livelihood of this world. Each and every organism in this universe need some of water to order to survive in its own living conditions. As rainfall is the main source of water and its need to agriculture is inevitable, there arises a necessity to analyze the pattern of the rainfall. The main aim of our paper is to predict the rainfall considering various factors like temperature, pressure, cloud cover, wind speed, pollution and precipitation. There are various ideas and new methodologies proposed in order to predict rainfall. But our proposed concept is based on machine learning because of its wide range of development and preferability nowadays. Among the various technologies built in Machine Learning (ML), Feed Forward Neural Network (FFNN) which is the simplest form of Artificial Neural Network (ANN) is preferred because this model learns the complex relationships among the various input parameters and helps to model them easily. Rainfall in our propo...

Revista Gestão Inovação e Tecnologias

Rudrappa Gujanatti

International Journal for Research in Applied Science and Engineering Technology IJRASET

IJRASET Publication

Substantial rainfall prediction is a significant issue for meteorological department as it is firmly connected with the economy and life of human. It is a reason for cataclysmic events like flood and drought which are experienced by people across the globe consistently. Exactness of rainfall forecasting has great importance for countries like India whose economy is largely dependent on agriculture. The proposed framework utilizes Machine Learning strategy which encourages us to predict the rainfall in an efficient way by using naïve bayes technique.

Chalachew Muluken Liyew

It is crucial to predict the amount of daily rainfall to improve agricultural productivities to secure food, and water quality supply to keep the citizen healthy. To predict rainfall, various researches are conducted using data mining and machine learning techniques of different countries’ environmental datasets. The Pearson correlation technique is used to select relevant environmental variables which are used as an input for the machine learning model of this study. The main objective of this study is to identify the relevant atmospheric features that cause rainfall and predict the intensity of daily rainfall using machine learning techniques. The dataset is collected from the local meteorological office to measure the performance of three machine learning techniques as Multivariate Linear Regression, Random Forest and Extreme Gradient Boost. Root mean squared error and Mean absolute Error are used to measure the performance of the machine learning model for this study. The result...

anurag kumar

Weather and climate prediction are dominated by high dimensionality, interactions on many different spatial and temporal scales, and chaotic dynamics. Machine learning techniques can predict rainfall by extracting hidden patterns from historical weather data. In this technique apply the Multiple Linear regression (MLR) and Support vector regression (SVR) model for rainfall prediction. To design and implement the system, we have gathered 115 years of data from 1901 to 2017 from Kaggle. Our proposed model has been tested and validated with respect to Multiple Linear regression and Support Vector regression. Compared results reveal the satisfactory performance, the SVR had provided maximum accuracy

Asian Journal of Computer Science and Technology

Muheet Ahmed Butt

Prediction of rainfall is one of the most essential and demanding tasks for the weather forecasters since ages. Rainfall prediction plays an important role in the field of farming and industries. Precise rainfall prediction is vital for detecting the heavy rainfall and to provide the information of warnings regarding the natural calamities. Rainfall prediction involves recording the various parameters of weather like wind direction, wind speed, humidity, rainfall, temperature etc. From last few decades, it has been seen that data mining techniques have achieved good performance and accuracy in weather prediction than traditional statistical methods. This research work aims to compare the performance of few data mining algorithms for predicting rainfall using historical weather data of Srinagar, India, which is collected from http://www.wundergrounds.com website. From the collected weather data which comprises of 9 attributes, only 5 attributes which are most relevant to rainfall pre...

Rahul Bansode

IRJET Journal

As agriculture being the key point of survival, Rainfall is the important source for its cultivation. Rainfall prediction has always been a major problem as prediction of rainfall gives awareness to people and to know in advance about rain so as to take necessary precautions to protect their crops from rain. A particular dataset is taken from Kaggle community and this project predicts whether it will rain tomorrow or not by using the rainfall in dataset. CatBoost model is implemented in this project as it is an open sourced machine learning algorithm, and features great quality without the parameter tuning, categorical feature support, improved accuracy and fast prediction. CatBoost model is a gradient boosting toolkit and two critical algorithms classical and innovative are introduced to create a fight in prediction shift present in currently existing implementations of gradient boosting algorithms. CatBoost performed very well giving an AUC (Area under curve) score 0.8 and ROC (Receiver operating characteristic curve) score as 89. ROC is called as an evaluating curve whereas AUC presents a degree or measure of separability as the model is skilled enough to distinguish between classes. An Exploratory data analysis is done to examine data distribution, outliers and provides tools for visualizing and understanding the data through graphical representation. A dashboard is implemented to showcase the information that is represented in datasets i.e. any changes in the data will result in different types of graphs. A linear SVC (Support vector classifier) provides a best fit hyperplane that divides the data and feeds some features to the classifier to detect what the predicted class is and results in desired output.

In India, agriculture is the most important factor for survival of human being. For agriculture, the most important thing is water. i.e., rainfall. Nowadays rainfall prediction is a major problem. Predicting the amount of rainfall gives alertness to farmers by knowing early so that they can protect their crops and properties from rain. There are more techniques to predict the rainfall. The ML algorithms are best suited for prediction of rainfall. Here are some of the major ML algorithms used rapidly which are Auto regressive integrated moving average Model (ARIMA), Artificial neural network (ANN), Support Vector Machine, Logistic regression, and Self organizing map. And there are two models commonly used to predict periodic rainfall such as Linear and Non-linear models. ARIMA Model is the first used model. Although using ANN (Artificial neural network) the prediction of rain can easily completed by using Cascade NN, Layer recurrent network, or Back propagation NN. Artificial NN is similar as Biological neural networks.

RELATED PAPERS

IAEME PUBLICATION

IAEME Publication

Chaos: An Interdisciplinary Journal of Nonlinear Science

Mahendra Verma

Raharja Dangin

Current Psychology

Lauren Wadsworth

Oncology Reports

Daniele Santini

Dirk Janasek

Transplantation

Applied System Innovation

amir momenzadeh

chikezie waturuocha

Einav Aizikovitsh-Udi

American Journal of Mathematical and Management Sciences

LSU毕业证成绩单 LSU留信网认证

arXiv (Cornell University)

György Terdik

International Journal of Innovation and Applied Studies

Revista Latinoamericana De Estudios Del Trabajo

Vanesa Vazquez Laba

tty29981 tty29981

مجلة العربي

khalid touzani

UOIT留学生文凭证书成绩单办理 定做安大略理工大学毕业证成绩单

European Neuropsychopharmacology

Bruno MASTAIN

실시간카지노 토토사이트

RELATED TOPICS

  •   We're Hiring!
  •   Help Center
  • Find new research papers in:
  • Health Sciences
  • Earth Sciences
  • Cognitive Science
  • Mathematics
  • Computer Science
  • Academia ©2024

Prediction Of Rainfall Using Machine Learning

Ieee account.

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

IMAGES

  1. Rain Prediction

    rainfall prediction using machine learning thesis

  2. Rainfall Prediction using Machine Learning

    rainfall prediction using machine learning thesis

  3. Rain prediction model using Machine Learning

    rainfall prediction using machine learning thesis

  4. Rainfall Prediction with Machine Learning

    rainfall prediction using machine learning thesis

  5. Python|Machine Learning|Rainfall Prediction-MyProjectBazaar

    rainfall prediction using machine learning thesis

  6. Multi-step rainfall forecasting using deep learning approach [PeerJ]

    rainfall prediction using machine learning thesis

VIDEO

  1. Technical seminar ♤Machine Learning based rainfall prediction ♤

  2. Rainfall Prediction using Machine Learning

  3. Rainfall Prediction Using ML: AI project for class 12th

  4. Weather Prediction Using Machine Learning

  5. Training Title- Data Science and Machine Learning || Project Title- Rainfall Prediction

  6. Rainfall Prediction using LogisticRegression Flask Web App

COMMENTS

  1. PDF Rainfall Prediction Using Machine Learning

    Rainfall plays a vital role. Approaches that are commonly used to predict rainfall. One option is to examine the large volumes of data accumulated over time to obtain information on future rains. Creating mathematical equations by defining various parameters and substituting values to achieve the result is the other.

  2. PDF Rainfall Prediction Using Machine Learning Techniques Gar a Thesis

    Zanyar Rzgar Ahmed: RAINFALL PREDICTION USING MACHINE LEARNING TECHNIQUES Approval of Director of Graduate School of Applied Sciences Prof. Dr. Nadire Cavus We certify this thesis is satisfactory for the award of the degree of Master of Science in Computer Engineering Examining Committee in Charge:

  3. Assessing rainfall prediction models: Exploring the advantages of

    Rainfall Prediction Using Machine Learning Algorithms for the Various Ecological Zones of Ghana. IEEE Access, 10 (2021), 10.1109/ACCESS.2021 ... Swapan Talukdar, X. Cuong Nguyen, Ali Najah Ahmed, and Duong Tran Anh. "Groundwater level prediction using machine learning algorithms in a drought-prone area." Neural Computing and Applications 34, no ...

  4. [PDF] Rainfall Prediction Using Machine Learning Algorithms for the

    The performance of the classification algorithms was examined based on precision, recall, f1-score, accuracy and execution time with various training and testing data ratios, and Decision Tree is consistently portrayed as the fastest, whereas MLP used the most run time. Accurate rainfall prediction has become very complicated in recent times due to climate change and variability. The ...

  5. Rainfall Prediction Using Machine Learning

    Rainfall prediction is crucial across various sectors, and this research examines the effectiveness of machine learning (ML) algorithms in forecasting rainfall occurrences using meteorological data. The study rigorously explores a comprehensive methodology encompassing data preprocessing, model building with various ML algorithms, and thorough evaluation methods. The dataset consists of a ...

  6. Rainfall Prediction Using an Ensemble Machine Learning Model Based on K

    Rahman et al. handled the rainfall prediction by using a machine learning fusion technique. The results of the machine learning models were given to another layer where fuzzy logic-based rules were applied for the final prediction. ... Ph.D. Thesis, University of Zagreb, Faculty of Organization and Informatics, Department of Information Systems ...

  7. Machine Learning-Based Rainfall Prediction: Unveiling Insights and

    Abstract: Rainfall prediction plays a crucial role in raising awareness about the potential dangers associated with rain and enabling individuals to take proactive measures for their safety. This study aims to utilize machine learning algorithms to accurately predict rainfall, considering the significant impact of scarcity or extreme rainfall on both rural and urban life.

  8. Rainfall prediction: A comparative analysis of modern machine learning

    4. Weather datasets. In this section, the climatic features included in the weather datasets are first described. Subsequently, the description of the Correlation Matrix analysis and the feature selection process carried out as part of the pre-processing procedure to prepare the time-series data for use in the training of the rainfall forecast models is given.

  9. Rainfall Prediction Using Machine Learning Techniques a Thesis

    Semantic Scholar extracted view of "RAINFALL PREDICTION USING MACHINE LEARNING TECHNIQUES A THESIS SUBMITTED TO THE GRADUATE SCHOOL OF APPLIED SCIENCES OF NEAR EAST UNIVERSITY By ZANYAR" by R. Ahmed et al.

  10. Machine Learning Algorithms for Advanced Rainfall Prediction

    Traditional rainfall forecasting systems have accuracy and lead time constraints. The rise of machine learning (ML) techniques provides a viable route for addressing these issues. This review paper examines current advancements in the application of machine learning for forecasting rainfall.

  11. Rainfall prediction using machine learning techniques

    Rainfall prediction using machine learning techniques. India is a farming nation and its economy is to a great extent dependent on rainforest creation. Downpour estimates are vital and fundamental for all ranchers to examine crop yields. Unsurprising rainfall is the capacity to foresee the climate with the assistance of science and innovation.

  12. Predicting Rainfall using Machine Learning Techniques

    Rainfall prediction is one of the challenging and uncertain tasks which has a signi cant impact on human society. Timely and accu-rate predictions can help to proactively reduce human and nancial loss. This study presents a set of experiments which involve the use of preva-lent machine learning techniques to build models to predict whether it

  13. (PDF) Rainfall Prediction Using Machine Learning

    Rainfall Prediction Using Machine Learning. Arnav Garg. Department of Co mputer Science Engineering, SRM IST, Kanchipuram Tamil Nadu. Himanshu Pandey. Department of Co mputer Science Engineering ...

  14. Experimental Analysis of Precipitation Forecasting Using Machine

    Oswal N (2019) Predicting rainfall using machine learning techniques. ArXiv. Google Scholar Parisi GI, Kemker R, Part JL, Kanan C, Wermter S (2019) Continual lifelong learning with neural networks: a review. ... Table 1.1 Analysis rainfall prediction using machine learning classification algorithms at the different split of training and testing ...

  15. PDF Rainfall Prediction Using Machine Learning Models: Literature Survey

    4 Rainfall Prediction Using Machine Learning Models: Literature Survey 77 Fig. 4.1 Pie chart showing proportions by publication year for papers in this review Google scholar was used to collect papers from 2016 to 2020, with the following key words: ("machine learning" OR "deep learning") AND ("precipitation predic-

  16. Rainfall Prediction Using Deep Learning and Machine Learning Techniques

    The conventional method of predicting rainfall has depended on historical information and meteorological observations. Yet, more precise and timely predictions of rainfall patterns are now possible thanks to the development of sophisticated computer models, remote sensing technologies, machine learning, and deep learning algorithms.

  17. Rainfall forecasting model using machine learning methods: Case study

    By adding this step into the ML, the proposed BDTR model gives the best accuracy in predicting the rainfall where the value of coefficient of determination in predicting daily rainfall it ranges between (0.5525075-0.9739693), and for weekly rainfall prediction it ranges between (0.8400668-0.989461), and for 10 days rainfall prediction it ranges ...

  18. PDF Rainfall Prediction System Using Machine Learning Fusion for Smart Cities

    A novel and real-time rainfall prediction system is proposed by this research for smart cities by using machine learning fusion. The proposed framework would extract the real-time feature-based weather data from highly sensitive and technologically advanced weather sensors for real-time rainfall prediction.

  19. Machine learning techniques to predict daily rainfall amount

    The machine learning algorithm called linear regression is used for predicting the rainfall using important atmospheric features by describing the relationship between atmospheric variables that affect the rainfall [13, 15].The correlation study is conducted [], and identified solar radiation, perceptible water vapor, and diurnal features are important variables for daily rainfall prediction ...

  20. Rainfall Forecast Using Machine Learning with High ...

    Increasing the accuracy of rainfall forecasts is crucial as an effort to prevent hydrometeorological disasters. Weather changes that can occur suddenly and in a local scope make fast and precise weather forecasts increasingly difficult to inform. Additionally, the results of the numerical weather model used by the Indonesia Agency for Meteorology, Climatology, and Geophysics are only able to ...

  21. Rainfall Prediction Using Machine Learning Models: Literature Survey

    This chapter performs a review on 66 research papers that use machine learning tools to predict rainfall. The papers are examined in terms of the source of the data, output objective, input ...

  22. Rainfall Prediction Using Machine Learning Models: Literature Survey

    With the advancement of computer technology, machine learning has been extensively used in the area of rainfall prediction. However, some papers suggest that applications of machine learning in different fields are deficient is some respects. This chapter performs a review on 66 research papers that use machine learning tools to predict rainfall.

  23. Rainfall Prediction using Machine Learning

    Finally, rainfall data is designated into various categories as per Indian Meteorological Department guidelines. In this paper we have come up with an approach for the prediction of rainfall using Machine Learning classification algorithms. The pre-processed data is segregated into 70% training and 30% for testing.

  24. Predicting Rainfall using Machine Learning Techniques

    Predicting Rainfall using Machine Learning. T echniques. Nikhil Oswal. School of Electrical Engineering and Computer Science (EECS), University of Otta wa, Ottawa, Canada. [email protected] ...

  25. Prediction Of Rainfall Using Machine Learning

    Rainfall Prediction is a challenging task due to irregular patterns of rainfall and climate variations all around the world. Rainfall forecasts helps to prevent floods and even helps in agriculture for growing crops. Machine learning aids in rainfall prediction by using unknown patterns from historical weather data. In this paper, classification technique of machine learning is used for ...

  26. Machine Learning Algorithms for Advanced Rainfall Prediction

    The study compared the performance of Naive Bayes, logistic regression, and support vector machine algorithms for sentiment analysis on the tweets and showed that all three algorithms performed well in classifying the sentiments, with support vector machines providing the highest accuracy.

  27. PDF Microsoft Word

    This thesis titled "Machine Learning Model on Rainfall - A Predictive Approach for Bangladesh" submitted by Yousuf Zaman, Student ID: 012102012, has been accepted as Satisfactory in fulfillment of the requirement for the degree of Master of Science in Computer Science and Engineering on February, 2018.