Welcome to the official GitHub repository for the Data analysis and visualisation in Python (2021) series by Carberra Tutorials!
This repository is designed purely as a supplementary aid to the series, and should NOT be downloaded without having watched it first.
You can browse the tags to view the code as it was after a specific episode.
- Python 3.7.1 or greater
- A Jupyter-compatible development environment
- Some data to analyse! If you're stuck for ideas, you can use these datasets.
Required | Used in series | |
---|---|---|
pandas | >= 1.2.0 | 1.2.3 |
seaborn | >= 0.11.0 | 0.11.1 |
scikit-learn | >= 0.20.0 | 0.24.1 |
openpyxl | >= 3.0.0 | 3.0.7 |
ipykernel* | >= 5.0.0 | 5.5.3 |
analytix** | >= 1.1.1 | N/A |
*This is only needed if you are using IPython notebooks. These are used in the series. **This library was used to get the data being analysed, and does not feature in the series.
To install the necessary libraries, run one of the the following commands:
# Windows
py -3.9 -m pip install pandas seaborn scikit-learn openpyxl ipykernel analytix
# Linux/macOS
python3.9 -m pip install pandas seaborn scikit-learn openpyxl ipykernel analytix
# In a virtual environment
pip install pandas seaborn scikit-learn openpyxl ipykernel analytix
This repository is made available via the BSD 3-Clause License.
If you need help using this repository, watch the series. If you still need help beyond that, join the Carberra Tutorials Discord server.