Skip to content
View bernasiakk's full-sized avatar

Block or report bernasiakk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bernasiakk/README.md

Hi there πŸ‘‹

πŸ‘‹ I’m Szymon Bernasiak, an aspiring Data Engineer with a passion for transforming raw data into actionable insights through efficient ETL pipelines and cloud technologies.

πŸš€ ETL Projects:
I have hands-on experience building scalable ETL solutions, utilizing tools such as Azure Data Factory, Databricks, and Synapse Analytics. My projects involve cloud migrations, data quality management, and automating reporting pipelines. You can explore them in the repositories below!

πŸ”§ Tech Stack:

  • Cloud Platforms: Azure, Microsoft Fabric
  • Languages & Tools: PySpark, SQL, Power BI
  • Data Management: Lakehouse architecture, Blob Storage, Datalake Gen2

🏑 In addition to my professional interests, I enjoy working on home automation projects to make everyday life smarter and more connected.

πŸ’¬ Feel free to reach out if you want to collaborate, discuss data engineering best practices, or simply talk about innovative data solutions.

My 3 latest projects

Earthquake API pipeline with Fabric

This project automates the process of collecting and analyzing the latest earthquake events using Earthquake API and Microsoft Fabric. The goal is to gather fresh information daily, clean the data, and display the results in a Power BI dashboard.

drawing

Turning book-ratings.csv into a relational database

This project automates the process of:

  1. creating a relational database from a book-ratings.csv file,
  2. and appending new records to the new db on a daily basis.
drawing

Building daily news pipeline in Fabric

This project automates the process of collecting and analyzing the latest news using the Bing Web Search API and Microsoft Fabric. The goal is to gather fresh news daily, perform sentiment analysis on news articles, and display the results in a Power BI dashboard. The pipeline is built using Microsoft Fabric components, including Data Factory, Lakehouse, Jupyter Notebooks, and Power BI, ensuring smooth orchestration, processing, and visualization of news data. Alerts are configured to notify when new data is available, allowing for timely reviews.

drawing

Popular repositories Loading

  1. OnPrem_DB_Migration_To_Azure OnPrem_DB_Migration_To_Azure Public

    Jupyter Notebook 1

  2. yelposphere yelposphere Public

    Forked from itsadityagupta/yelposphere

    Yelp Data Processing Pipeline on GCP

    HCL

  3. Morning-Report Morning-Report Public

    Python

  4. Pipeline-for-Sales-Data-in-GCP Pipeline-for-Sales-Data-in-GCP Public

    Python

  5. Apartment-Sales-in-Poland--GCP- Apartment-Sales-in-Poland--GCP- Public

    Python

  6. Energy-Data-in-Poland Energy-Data-in-Poland Public

    Python