Skip to content

This GitHub repository serves as a comprehensive platform for managing and showcasing my data engineering projects and assessments throughout my final semester at Alt School Africa. Designed to foster collaboration, organization, and continuous improvement, this repository is the backbone of my academic journey in data engineering.

License

Notifications You must be signed in to change notification settings

victorcezeh/data-engineering-final-semester-portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

final-semester

Data Engineering Final Semester Portfolio

Welcome to my Data Engineering final Semester Portfolio Repository! This repository serves as a centralized platform for managing and showcasing my data engineering projects throughout the semester.

Table of Contents

  1. About
  2. Projects
  3. Contributing
  4. Contact

About

In this repository, you'll find a collection of projects and assessments that demonstrate my proficiency in various aspects of data engineering. Each project is designed to challenge and enhance my skills, covering topics such as data pipelines, ETL processes, Cloud, big data technologies, and more.

Projects

  • Project 1: postgres_docker_init - This project sets up and tests PostgreSQL infrastructure using Docker and Docker Compose. It involves creating a Dockerized PostgreSQL server, loading data from a CSV file, and writing Python scripts to interact with the database. Includes detailed README documentation.
  • Project 2: py_gcs_bq - This Python project enables seamless interaction with Google Cloud Storage (GCS) and BigQuery. It supports loading CSV files from a local machine into BigQuery and fetching API data to store in GCS, which is then loaded into BigQuery. The code is designed to be idempotent, reusable, and well-documented, with secrets managed via a .env file and constants through config.py. Includes detailed README documentation.

Contributing

Collaboration is key to success in data engineering. If you have any suggestions, enhancements, or additional projects to contribute, feel free to fork this repository, make your changes, and submit a pull request. Your contributions are highly appreciated!

Contact

If you have any questions, feedback, or concerns, don't hesitate to reach out:

About

This GitHub repository serves as a comprehensive platform for managing and showcasing my data engineering projects and assessments throughout my final semester at Alt School Africa. Designed to foster collaboration, organization, and continuous improvement, this repository is the backbone of my academic journey in data engineering.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages