Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add notebook for audio/video transcription with timestamps #19

Open
GalenReich opened this issue Jul 16, 2024 · 3 comments
Open

Add notebook for audio/video transcription with timestamps #19

GalenReich opened this issue Jul 16, 2024 · 3 comments
Labels
enhancement New feature or request

Comments

@GalenReich
Copy link
Contributor

A useful notebook (or addition to the existing whisper transcription notebook) would be one that enabled users to run whisper over audio for transcription, and then access produce word- or sentence-level timestamps (not part of whisper's functionality).

A package like https://github.com/linto-ai/whisper-timestamped might make this easy.

@GalenReich GalenReich added the enhancement New feature or request label Jul 16, 2024
@belisards
Copy link

Maybe also check https://github.com/Vaibhavs10/insanely-fast-whisper) and its --timestamp {chunk,word} parameter

Here is a notebook using it: https://github.com/Vaibhavs10/insanely-fast-whisper/blob/main/notebooks/infer_faster_whisper_large_v2.ipynb

@GalenReich
Copy link
Contributor Author

That's very cool @belisards - thank you! word level timestamps probably isn't important for most research, so those sentence-level ones look really good!

@belisards
Copy link

Yep, I agree but look: even in its vanilla version, Whisper generates timestamps. Here it is in a notebook I created for a workshop last year: https://github.com/belisards/nlp_intro/blob/main/whisper.ipynb

Another feature that might be useful for open-source research is speech diarization. There is is a great video covering many Whisper variants and features like this: https://www.youtube.com/watch?v=Thc0vtnWYOo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants