Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Delayed Captions #271

Open
OutrageousStorm opened this issue Dec 14, 2024 · 4 comments
Open

[BUG] Delayed Captions #271

OutrageousStorm opened this issue Dec 14, 2024 · 4 comments
Assignees

Comments

@OutrageousStorm
Copy link

Describe the bug
Captions are not synced to VO clips.

To Reproduce
Steps to reproduce the behavior:

  1. Create video in current version. The more paragraphs you add, the easier to tell.

Expected behavior
VO clips should be in sync with the captions.

Desktop (please complete the following information):

  • OS: Windows
  • Browser: Firefox
  • Python Version: 3.11.

Additional context
Here's a video showing what happens.

output.mp4
@mcjill
Copy link

mcjill commented Dec 15, 2024

I hear you, @OutrageousStorm, but I've used ZapCap's API, and it's actually pretty solid no lag here. Maybe check your setup? Most caption tools rely on it for a reason.

@FujiwaraChoki
Copy link
Owner

This has been an issue since V1. If anybody is willing to open a PR, I'd appreciate that.

@mrflolo
Copy link

mrflolo commented Jan 13, 2025

Integrating OpenAI whisper as a local model that does the transcribing could fix the issue. Take a look at the stable whisper python package. It has direct functionality to split the transcribed segments into smaller segments, that have acurat timestamps.

import stable_whisper
model = stable_whisper.load_model('base')
result = model.transcribe(audio_path, verbose=None).split_by_length(10)
result.to_srt_vtt(srt_path, word_level=False )

this would do the job and also output the transcript as a .srt

@hellofresh01
Copy link

Can you guys help me in setup? Cross-Origin Request Blocked: The Same Origin Policy disallows reading the remote resource at http://localhost:8080/api/generate. (Reason: CORS header ‘Access-Control-Allow-Origin’ missing). Status code: 200.

Cross-Origin Request Blocked: The Same Origin Policy disallows reading the remote resource at http://localhost:8080/api/generate. (Reason: CORS request did not succeed). Status code: (null) I am getting this issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants