Name		Name	Last commit message	Last commit date
parent directory ..
241-riffusion-text-to-music.ipynb		241-riffusion-text-to-music.ipynb
241-riffusion-text-to-music.png		241-riffusion-text-to-music.png
README.md		README.md

README.md

Text-to-Music generation using Riffusion and OpenVINO

Riffusion is a latent text-to-image diffusion model capable of generating spectrogram images given any text input. These spectrograms can be converted into audio clips. General diffusion models are machine learning systems that are trained to denoise random gaussian noise step by step, to get to a sample of interest, such as an image. Diffusion models have shown to achieve state-of-the-art results for generating image data. But one downside of diffusion models is that the reverse denoising process is slow. In addition, these models consume a lot of memory because they operate in pixel space, which becomes unreasonably expensive when generating high-resolution images. Therefore, it is challenging to train these models and also use them for inference. OpenVINO brings capabilities to run model inference on Intel hardware and opens the door to the fantastic world of diffusion models for everyone!

In this tutorial, we consider how to run an text-to-music generation pipeline using Riffusion and OpenVINO. We will use a pre-trained model from the Diffusers library. To simplify the user experience, the Hugging Face Optimum Intel library is used to convert the models to OpenVINO™ IR format.

The complete pipeline of this demo is shown below.

Notebook Contents

This notebook demonstrates how to convert and run riffusion using OpenVINO.

The tutorial consists of the following steps:

Install prerequisites
Download and convert the model from a public source using the OpenVINO integration with Hugging Face Optimum.
Create an text-to-music inference pipeline
Run inference pipeline

This notebook provides interactive interface, where user can insert own musical input prompt and model will generate spectrogram image and sound guided by provided input. The result of demo work illustrated on image below.

Installation Instructions

If you have not installed all required dependencies, follow the Installation Guide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

241-riffusion-text-to-music

241-riffusion-text-to-music

README.md

Text-to-Music generation using Riffusion and OpenVINO

Notebook Contents

Installation Instructions

Files

241-riffusion-text-to-music

Directory actions

More options

Directory actions

More options

Latest commit

History

241-riffusion-text-to-music

Folders and files

parent directory

README.md

Text-to-Music generation using Riffusion and OpenVINO

Notebook Contents

Installation Instructions