-
EPFL
- lausanne
- @anvilarth
Stars
A Unified Tokenizer for Visual Generation and Understanding
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
Text and image to video generation: Kandinsky 4.0 (2024)
Implementation of Implicit Reparameterization Trick
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
Kandinsky x Deforum — generating short animations
Official implementation of "Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes"
Сutting-edge Python library designed for generative image augmentation!
Official inference repo for FLUX.1 models
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.
Efficient DL/ML Models Seminars
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
The aim of this repository is to test and implement Flow-Matching-based models
Package for faster parallel removing objects in Unix systems (parallel rm rf)
Framework for processing and filtering datasets
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…
collection of diffusion model papers categorized by their subareas
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
A neural network training framework within a task-based parallel programming paradigm
Efficient PScan implementation in PyTorch