π Iβm Szymon Bernasiak, an aspiring Data Engineer with a passion for transforming raw data into actionable insights through efficient ETL pipelines and cloud technologies.
π ETL Projects:
I have hands-on experience building scalable ETL solutions, utilizing tools such as Azure Data Factory, Databricks, and Synapse Analytics. My projects involve cloud migrations, data quality management, and automating reporting pipelines. You can explore them in the repositories below!
π§ Tech Stack:
- Cloud Platforms: Azure, Microsoft Fabric
- Languages & Tools: PySpark, SQL, Power BI
- Data Management: Lakehouse architecture, Blob Storage, Datalake Gen2
π‘ In addition to my professional interests, I enjoy working on home automation projects to make everyday life smarter and more connected.
π¬ Feel free to reach out if you want to collaborate, discuss data engineering best practices, or simply talk about innovative data solutions.
This project automates the process of collecting and analyzing the latest earthquake events using Earthquake API and Microsoft Fabric. The goal is to gather fresh information daily, clean the data, and display the results in a Power BI dashboard.
This project automates the process of:
- creating a relational database from a book-ratings.csv file,
- and appending new records to the new db on a daily basis.
This project automates the process of collecting and analyzing the latest news using the Bing Web Search API and Microsoft Fabric. The goal is to gather fresh news daily, perform sentiment analysis on news articles, and display the results in a Power BI dashboard. The pipeline is built using Microsoft Fabric components, including Data Factory, Lakehouse, Jupyter Notebooks, and Power BI, ensuring smooth orchestration, processing, and visualization of news data. Alerts are configured to notify when new data is available, allowing for timely reviews.