Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement whisper-timestamped #10

Open
villesau opened this issue Oct 1, 2024 · 6 comments
Open

Implement whisper-timestamped #10

villesau opened this issue Oct 1, 2024 · 6 comments

Comments

@villesau
Copy link

villesau commented Oct 1, 2024

Hi, https://github.com/linto-ai/whisper-timestamped seems like an interesting approach for accurate timestamps, and apparently would not have problems with numerics and so on. Would it be a big effort to implement a replicate endpoint for that too?

@Huanshere
Copy link

I fully tested and compared these two methods in my project VideoLingo, I gotta say the timestamp of whisperX is way more stable than whisper-timestamped, it can addresses Whisper's inherent hallucination issue through forced alignment.

@villesau
Copy link
Author

villesau commented Oct 10, 2024

Yep I noticed the same in the end, whisper-timestamped was very far from accurate timestamps. https://github.com/jianfch/stable-ts seems better than that at least. Didn't test against WhisperX yet, but it does not suffer from the numerics problem that WhisperX suffers from, and is way better than whisper-timestamped.

@Huanshere
Copy link

Thanks for sharing, stable-ts looks so gooood and it deserves 100k stars! It shows how important to name your project in a SEO friendly way ahaha. I'll test it out right away.

@villesau
Copy link
Author

Yep it definitely wasn't the first option I found either :) I found it very randomly actually.

@Huanshere
Copy link

Tested, just so perfect, I can't ask for more... What surprised me is it doesn't need a wav2vac model specific for a single language to perform the force alignment, which makes it super fast and super lite. I will definately replace whisperX with stable-ts in my project ahaha. But unfortunately stable-ts on replicate is not up-to-date, I may need to pack one myself. Thanks again for sharing this 👍

@flazouh
Copy link

flazouh commented Dec 26, 2024

is it possible to run stable-ts on replicate ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants