Time Series Forecasting with XGBoost and LightGBM: Predicting Energy Consumption Problem Preprocessing Training the Models Evaluation Preprocessing Weather Data Conclusion & Future Steps

In the only terms, time series forecasting is the technique of predicting future values based on previous historical data. Certainly one of the most popular fields where time series forecasting is utilized currently is within the cryptocurrencies market, where one desires to predict how prices in popular cryptos, like Bitcoin or Ethereum, will fluctuate in the subsequent few days and even longer periods of time. One other real world case is the energy consumption prediction. Especially within the contemporary world where energy is one in all the first points of dialogue, being able to accurately predicting the demand of energy consumption is an important tool for any electric power company. In this text, we are going to take a fast but practical take a look at how this is finished by incorporating Ensemble models reminiscent of extreme gradient boosting or XGBoost and lightweight gradient boosting or LGB models.

We are going to deal with the energy consumption problem, where given a sufficiently large dataset of the each day energy consumption of various households in a city, we’re tasked to predict as accurately as possible the longer term energy demands. For the needs of this tutorial, I’ve chosen the London Energy Dataset which accommodates the energy consumption of 5,567 randomly chosen households in the town of London, UK for the time period of November 2011 to February 2014. In a while, in an try to improve our predictions we mix this set with the London Weather Dataset with the intention to add weather related data in the method.

The very very first thing we now have to do in every project is to get an excellent understanding of the info and preprocess them if needed. To view the info with pandas we will do: