CECALT (CEnter of meteorologiCAL Technology) is an integral project whose objective is to increase the sensitivity of hurricane prediction systems. It contains all the code and data needed to deploy an end-to-end machine learning project on a running CML instance.
Its primary goal is to build an ensemble learning model that combines the predictors of three base models:
- Random Forest Regressor
- XGBoost Regressor
- Neuronal Network Regressor
To forecast the wind speed based on geographical and meteorological conditions, like in the following example:
The ensembling model uses a voting strategy to make its predictions, where each base model has an equal weight in the final prediction.
This model is trained on a dataset of wind speed data and evaluated using MSE (Mean Squared Error), MAE (Mean Absolute Error), R2 score and RMSE (Root Mean Square Error) metrics.
The dataset used in this project comes from the National Hurricane Center and the Meteostat Project, with the aid of analysis and transformations.
The project is organized with the following folder structure:
.
├── app/ # Sources needed to launch the application
├── scripts/ # Scripts used for the creation and deploy of the end-to-end solution
├── src/ # All the prebuilt models and datasets necessary for the project
├── static/ # All images used in the project
├── .gitignore
├── .project-metadata.yaml
├── CECALT Hurricane Behavior Predictor_SOP.pdf
├── LICENSE.txt
├── README.md
├── cdsw-build.sh # This file is IMPORTANT for model deployment as additional libraries are installed here.
└── requirements.txt
The file CECALT Hurricane Behavior Predictor_SOP.pdf
contains a deeper walk-through of the project.
The ways of executing the project are the following ones:
-
As AMP - In a CML workspace, click "New Project", add a Project Name, select "AMPs" as the Initial Setup option, copy in the repo URL, click "Create Project", click "Configure Project"
-
Manual Setup - In a CML workspace, click "New Project", add a Project Name, select "Git" as the Initial Setup option, copy in the repo URL, click "Create Project". Then, follow the steps listed in this document.
In general, the project contains mechanisms to guarantee a safe execution but if anything happens, it would be desirable to check both the documentation and the technical configuration.
DO NOT remove cdsw-build.sh as it will cause troubles in the model deployment.