Contributors: Lauren Zung
A mini project completed during the Fall 2022 session of DSCI 573 (Feature and Model Selection).
For this regression problem, I chose to optimize LGBMRegressor
returned a score of 0.349. I have also computed a root mean squared error of 1.392, therefore the model predictions are approximately off by 1.392 reviews per month.
Score | |
---|---|
Train |
0.439 |
Train RMSE | 1.223 |
Test |
0.349 |
Test RMSE | 1.392 |
To replicate this analysis, you will first need to clone the repo. You can do so with the following command line (terminal) commands.
# clone the repo
git clone https://github.com/lzung/airbnb_reviews.git
# change working directory to the root of the repository
cd airbnb_reviews
First, create and activate the required virtual environment with conda at the command line as follows:
conda env create -f environment.yaml
conda activate airbnb_reviews
Then, run the following command at the command line (terminal) to reset the repository to a clean state, with no intermediate or results files:
make clean
Finally, run the following command to replicate the analysis:
make all