This data-driven project provides valuable insights and predictions regarding graduate salaries, benefiting both recent graduates and employers in a competitive job market.
Graduates often struggle to negotiate fair salaries for their first jobs, while employers find it challenging to determine compensation based on factors like education and experience. This project aims to develop predictive models and uncover essential insights to estimate engineering graduate salaries effectively.
-
Data Analysis:
- Conduct exploratory data analysis (EDA) to understand relationships between variables and graduate salaries.
- Identify patterns and trends within the dataset.
-
Feature Engineering:
- Engineer relevant features impacting salary predictions.
- Handle categorical variables, missing data, and create new features.
-
Machine Learning Models:
- Train and evaluate a variety of regression models to predict engineering graduate salaries.
-
Hyperparameter Tuning:
- Fine-tune models by optimizing hyperparameters for improved accuracy.
-
Model Evaluation:
- Assess model performance using metrics such as R2 score, Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE).
-
Interpretability:
- Provide insights into features with significant impacts on salary predictions, aiding graduates and employers in understanding salary determinants.
-
- Notebook 1.
- Data preprocessing.
- Deriving insights from data visualizations.
-
- Notebook 2.
- Machine learning model construction.
- Feature engineering.
- Trained and tested on major regression models.
- Various accuracy metrics.
- Hyperparameter tuning for enhanced accuracy.
-
- Describes dataset features.
-
Engineering_graduate_salary.csv
:- Contains engineers' records and salary data.
-
- Output of
Analysis.ipynb
. - Used for model building.
- Output of
Made by Hrishikesh Reddy Papasani
Connect on LinkedIn: LinkedIn Profile
- Dataset Source: Kaggle
Made by Hrishikesh Reddy Papasani
Connect on LinkedIn: LinkedIn Profile
Contact at [email protected]