Highlights
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Implementing best practices for PySpark ETL jobs and applications.
Implementing best practices for PySpark ETL jobs and applications.
Pyspark RDD, DataFrame and Dataset Examples in Python language
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.
An Awesome List of Open-Source Data Engineering Projects
arengel / rustlings
Forked from rust-lang/rustlings🦀 Small exercises to get you used to reading and writing Rust code!
pgAdmin is the most popular and feature rich Open Source administration and development platform for PostgreSQL, the most advanced Open Source database in the world.