Skip to content

jgorostegui/best-of-mlops-tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

54 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

best-of-mlops-tools

πŸ†Β  A ranked list of awesome projects. Updated weekly.

This curated list contains 24 awesome open-source projects with a total of 280K stars grouped into 9 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!


πŸ§™β€β™‚οΈΒ  Discover other best-of lists or create your own.


Contents

Explanation

  • πŸ₯‡πŸ₯ˆπŸ₯‰Β  Combined project-quality score
  • ⭐️  Star count from GitHub
  • 🐣  New project (less than 6 months old)
  • πŸ’€Β  Inactive project (6 months no activity)
  • πŸ’€Β  Dead project (12 months no activity)
  • πŸ“ˆπŸ“‰Β  Project is trending up or down
  • βž•Β  Project was recently added
  • ❗️  Warning (e.g. missing/risky license)
  • πŸ‘¨β€πŸ’»Β  Contributors count from GitHub
  • πŸ”€Β  Fork count from GitHub
  • πŸ“‹Β  Issue count from GitHub
  • ⏱️  Last update timestamp on package manager
  • πŸ“₯Β  Download count from package manager
  • πŸ“¦Β  Number of dependent projects
  • Β  Python projects
  • Β  C/C++ projects
  • Β  Uses/supports Tensorflow
  • Β  Uses/supports PyTorch
  • Β  Uses/supports ONNX (Open Neural Network Exchange)

Data Preprocessing Tools

Back to top

Tools for data cleaning, transformation, and enrichment.


Data visualization tools

Back to top

Tools for creating high-level visualizations

Gradio (πŸ₯‡36 Β· ⭐ 25K) - Build and share delightful machine learning apps, all in Python... Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 240 Β· πŸ”€ 1.8K Β· πŸ“¦ 21K Β· πŸ“‹ 3.5K - 12% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/gradio-app/gradio
    
Dash (πŸ₯ˆ32 Β· ⭐ 20K Β· πŸ“‰) - Data Apps & Dashboards for Python. No JavaScript Required. MIT
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.9K Β· πŸ“₯ 53 Β· πŸ“¦ 57K Β· πŸ“‹ 1.6K - 47% open Β· ⏱️ 09.01.2024):

     git clone https://github.com/plotly/dash
    
Streamlit (πŸ₯‰29 Β· ⭐ 30K) - Streamlit A faster way to build and share data apps. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 2.6K Β· πŸ“‹ 3.8K - 17% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/streamlit/streamlit
    
Show 1 hidden projects...
  • voila (πŸ₯‰28 Β· ⭐ 5K) - Voil turns Jupyter notebooks into standalone web applications. ❗Unlicensed

Data labelling tools

Back to top

Tools for data labelling

Show 1 hidden projects...
  • Labelimg (πŸ₯‡28 Β· ⭐ 21K Β· πŸ’€) - LabelImg is now part of the Label Studio community. The popular image.. MIT

Data pipelining tools for machine learning

Back to top

Frameworks that allow the creation, management, and optimization of complex data pipelines in a machine learning environment.

Airflow (πŸ₯‡43 Β· ⭐ 33K) - Apache Airflow - A platform to programmatically author, schedule, and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 3.1K Β· πŸ”€ 13K Β· πŸ“₯ 580K Β· πŸ“¦ 9.1K Β· πŸ“‹ 8.4K - 8% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/apache/airflow
    
Argo (πŸ₯ˆ39 Β· ⭐ 14K) - Workflow Engine for Kubernetes. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 830 Β· πŸ”€ 2.9K Β· πŸ“₯ 3.4M Β· πŸ“¦ 230 Β· πŸ“‹ 5.5K - 16% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/argoproj/argo-workflows
    
dagster (πŸ₯ˆ35 Β· ⭐ 9.4K) - An orchestration platform for the development, production, and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 380 Β· πŸ”€ 1.1K Β· πŸ“¦ 1.8K Β· πŸ“‹ 6.4K - 29% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/dagster-io/dagster
    
Kubeflow (Kubeflow Pipelines) (πŸ₯‰33 Β· ⭐ 13K) - Machine Learning Toolkit for Kubernetes. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 300 Β· πŸ”€ 2.1K Β· πŸ“₯ 79K Β· πŸ“¦ 41 Β· πŸ“‹ 3.8K - 5% open Β· ⏱️ 19.12.2023):

     git clone https://github.com/kubeflow/kubeflow
    
Luigi (πŸ₯‰32 Β· ⭐ 17K Β· πŸ“ˆ) - Luigi is a Python module that helps you build complex pipelines of.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 610 Β· πŸ”€ 2.3K Β· πŸ“¦ 2.3K Β· πŸ“‹ 980 - 9% open Β· ⏱️ 08.01.2024):

     git clone https://github.com/spotify/luigi
    
Pachyderm (πŸ₯‰31 Β· ⭐ 6K) - Data-Centric Pipelines and Data Versioning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 560 Β· πŸ“₯ 26K Β· πŸ“‹ 3.1K - 22% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/pachyderm/pachyderm
    
Metaflow (πŸ₯‰27 Β· ⭐ 7.3K Β· πŸ“‰) - Build and manage real-life data science projects with ease!. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 77 Β· πŸ”€ 650 Β· πŸ“¦ 580 Β· πŸ“‹ 590 - 41% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/Netflix/metaflow
    

Data versioning tools

Back to top

Tools for data (and model) versioning

dvc (Data Version Control) (πŸ₯‡38 Β· ⭐ 13K) - ML Experiments Management with Git. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 290 Β· πŸ”€ 1.1K Β· πŸ“₯ 45K Β· πŸ“¦ 12K Β· πŸ“‹ 4.6K - 11% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/iterative/dvc
    

MLOps end-to-end platforms

Back to top

MLOps platforms that covers all the machine learning lifecycle.

Kubeflow (πŸ₯‡33 Β· ⭐ 13K) - Machine Learning Toolkit for Kubernetes. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 300 Β· πŸ”€ 2.1K Β· πŸ“₯ 79K Β· πŸ“¦ 41 Β· πŸ“‹ 3.8K - 5% open Β· ⏱️ 19.12.2023):

     git clone https://github.com/kubeflow/kubeflow
    

MLOps experimentation tracking tools

Back to top

Tools for managing machine learning experimentation/workflows

MLFlow (Tracking) (πŸ₯‡37 Β· ⭐ 16K) - Open source platform for the machine learning lifecycle. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 680 Β· πŸ”€ 3.6K Β· πŸ“¦ 29K Β· πŸ“‹ 3.4K - 31% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/mlflow/mlflow
    
ClearML (πŸ₯‰30 Β· ⭐ 5K) - ClearML - Auto-Magical CI/CD to streamline your ML workflow... Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 88 Β· πŸ”€ 610 Β· πŸ“₯ 1.9K Β· πŸ“¦ 890 Β· πŸ“‹ 930 - 44% open Β· ⏱️ 10.01.2024):

     git clone https://github.com/allegroai/clearml
    
sacred (πŸ₯‰26 Β· ⭐ 4.1K) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 360 Β· πŸ“¦ 2.8K Β· πŸ“‹ 560 - 17% open Β· ⏱️ 13.11.2023):

     git clone https://github.com/IDSIA/sacred
    

Model Serving

Back to top

Tools for model serving

KServe (πŸ₯‡34 Β· ⭐ 2.7K) - Standardized Serverless ML Inference Platform on.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 830 Β· πŸ“₯ 870K Β· πŸ“¦ 300 Β· πŸ“‹ 1.6K - 19% open Β· ⏱️ 10.01.2024):

     git clone https://github.com/kserve/kserve
    
TensorFlow Serving (πŸ₯ˆ32 Β· ⭐ 6K) - A flexible, high-performance serving system for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 2.1K Β· πŸ“¦ 22 Β· πŸ“‹ 1.4K - 3% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/tensorflow/serving
    
BentoML (πŸ₯ˆ31 Β· ⭐ 6.1K) - Build Production-Grade AI Applications. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 680 Β· πŸ“₯ 2.1K Β· πŸ“¦ 1.5K Β· πŸ“‹ 970 - 18% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/bentoml/BentoML
    
TorchServe (πŸ₯ˆ31 Β· ⭐ 3.8K) - Serve, optimize and scale PyTorch models in production. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 780 Β· πŸ“₯ 5.5K Β· πŸ“¦ 580 Β· πŸ“‹ 1.5K - 20% open Β· ⏱️ 09.01.2024):

     git clone https://github.com/pytorch/serve
    
Triton Inference Server (πŸ₯‰29 Β· ⭐ 6.7K) - The Triton Inference Server provides an optimized.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1.3K Β· πŸ“₯ 210K Β· πŸ“‹ 3.3K - 10% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/triton-inference-server/server
    
Seldon (πŸ₯‰28 Β· ⭐ 4.1K Β· πŸ“ˆ) - An MLOps framework to package, deploy, monitor and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 200 Β· πŸ”€ 790 Β· πŸ“₯ 810 Β· πŸ“¦ 530 Β· πŸ“‹ 2.3K - 4% open Β· ⏱️ 09.10.2023):

     git clone https://github.com/SeldonIO/seldon-core
    

Model Training Tools

Back to top

Tools for training machine learning models.

TFX (πŸ₯‡25 Β· ⭐ 2K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 670 Β· πŸ“‹ 880 - 3% open Β· ⏱️ 11.01.2024):

     git clone https://github.com/tensorflow/tfx
    

Related Resources

Contribution

Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:

  • Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
  • Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.

If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.

For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.

License

CC0