Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load surah recording with ayah timstamps. #2814

Open
taominator opened this issue Aug 22, 2024 · 0 comments
Open

Load surah recording with ayah timstamps. #2814

taominator opened this issue Aug 22, 2024 · 0 comments

Comments

@taominator
Copy link

Assalamu alaikum.

Would it be possible to do what the title says. I've made a script that can be run on google collab to get the start and end timestamps of each word if you have a recording and the text file of a surah. It basically uses ai to generate a transcript of the recording with word timestamps. The transcript will have some occasional mistakes like combining multiple words into one or splitting one into multiple. Therefore, it then algorithmically aligns it with the text of the surah to get the best match. I used whisper large-v2 for the model and the timestamps are pretty accurate.

This would be really helpful if you want to imitate a certain recitation, especially for long surahs since getting to a specific ayah or repeting a section is extremely tedious on youtube. It's practically impossible for the app to have recordings of every single reciter. A lot of reciters have multiple styles as well. This would give the user the option of loading custom recordings on their device. I'm willing to implement this feature if you could explain and point me in the right section of the codebase.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant