Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Have you compared with open-sora's video wav encoder decoder? #2

Open
chenerg opened this issue Nov 7, 2024 · 1 comment
Open

Have you compared with open-sora's video wav encoder decoder? #2

chenerg opened this issue Nov 7, 2024 · 1 comment

Comments

@chenerg
Copy link

chenerg commented Nov 7, 2024

The open-sora-plan also provided a video encoder-decoder called WF-VAE, which decomposes video into sub-bands using wavelet transforms. The details can be found https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.3.0.md .

@qqingzheng
Copy link

qqingzheng commented Nov 7, 2024

WF-VAE has only released the distilled weights, not the direct training weights. By the way, I'm really thrilled to see this outstanding work cosmos tokenizer being open-sourced!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants