Skip to content
View jboarman's full-sized avatar
Sparkfish
Sparkfish

Sponsoring

@RandomFractals
@aloneguid
@sparkfish

Highlights

  • Pro

Organizations

@sparkfish

Block or report jboarman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Spark-free Python utilities for Microsoft Fabric focused on Data Engineering using Polars and delta-rs

Python 18 4 Updated Dec 29, 2024

😎 A curated list of awesome DataOps tools

Python 173 25 Updated Oct 14, 2024

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Python 2,016 223 Updated Feb 16, 2025

A light-weight, flexible, and expressive statistical data testing library

Python 3,626 321 Updated Feb 18, 2025

Semantic Functions for Semantic Link

Python 12 6 Updated Jan 23, 2025

Maestro: Netflix’s Workflow Orchestrator

Java 3,393 205 Updated Feb 17, 2025

Qubole Sparklens tool for performance tuning Apache Spark

Scala 570 140 Updated Jun 26, 2024

Samples on how to use Azure SQL database with Azure OpenAI

TSQL 79 34 Updated Nov 8, 2024

A Python framework for defining and querying BI models in your data warehouse

Python 164 6 Updated Jan 15, 2025

Display paginated content in the browser and generate print books using web technology

HTML 888 98 Updated Oct 4, 2024

Integration tests for dbt

Makefile 12 2 Updated Aug 26, 2023

Bloat-free, no BS cloud storage SDK.

C# 172 17 Updated Jan 31, 2025

Exposes the Windows Process creation Win32 functions in PowerShell

PowerShell 51 5 Updated Jan 4, 2025

Invoke Command As System/Interactive/GMSA/User on Local/Remote machine & returns PSObjects.

PowerShell 463 70 Updated May 5, 2023

No-code in the front, Python in the back. An open-source framework for creating data apps.

Python 1,375 82 Updated Feb 12, 2025

Yet another googlesearch - A Python library for executing intelligent, realistic-looking, and tunable Google searches.

Python 269 46 Updated Apr 7, 2024

🔥 Blazing fast bulk data transfers between any cloud 🔥

Python 1,119 63 Updated May 11, 2024

Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released!

C# 418 65 Updated Jul 25, 2024

All image quality metrics you need in one package.

Python 597 71 Updated Oct 4, 2023

Augraphy: Creating Realistic Document Image Datasets with Data Augmentation

Jupyter Notebook 6 4 Updated Aug 21, 2023
Python 3 2 Updated Dec 20, 2024

An SVG rendering library.

Rust 2,993 239 Updated Feb 5, 2025

Fully managed Apache Parquet implementation

C# 687 155 Updated Feb 6, 2025

2D/3D renderer - makes it simple to draw stuff across platforms (including web)

Rust 1,389 113 Updated Jan 30, 2025

Make writing easier!

Python 81 5 Updated Aug 15, 2021

A high-performance SVG renderer and toolkit, powered by Rust based resvg and napi-rs.

TypeScript 1,655 64 Updated Feb 12, 2025

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to o…

Jupyter Notebook 56 6 Updated Feb 18, 2025

Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets

Python 109 3 Updated Dec 4, 2024

Introducing the most comprehensive and up-to-date open source dataset on US car models on Github. With over 15,000 entries covering car models manufactured between 1992 and 2023, this repository of…

449 166 Updated May 5, 2024

The state-of-the-art image restoration model without nonlinear activation functions.

Python 2,369 302 Updated Jul 3, 2024
Next
Showing results