Add aligner #756

vadimdddd · 2021-11-09T16:37:53Z

Aligner is a program for aligning words in time relative to other words in audio file. Gentle project used m3.cc and k3.cc as language and acoustic models for alignment, these approaches were reworked into aligner, which made it possible to use different language models and accelerated the alignment process. Also in setup.py was added ability to run the aligner not only from the folder with it was added.

How to work:

You have to download any language model
You have to prepare .wav and .txt files
When starting the program, you have to specify the required arguments:
a) path to the wavfile; b) path to the textfile; c) path to the language model.

Example(how to run):
python3 vosk_align.py example/glorious.wav example/glorious.txt example/model

python/vosk/aligner/language_model.py

python/aligner/example/lucier.txt

python/vosk/aligner/multipass.py

python/vosk/aligner/transcriber.py

python/vosk/aligner/transcription.py

nshmyrev · 2021-11-09T16:52:40Z

I'm also waiting for the tests for the aligner so we can automatically verify the code

…ged value name LM to recognizer

…duration as parametr(input/output), set realign case audiofile start position as tell(), splited lines for more readable script

…SS, NFIA, NFIT attributes from str to int)

…ithm of obtaining chunk's start/end idxs, deleted condition non-existing start/end edges of chunk, added adjustment values shift_start/end tuning left/right edges of chunk

…stakes inside, either test_align.py script with 5 tests which using pytest, vosk_align.pt was modified for testing outside

…mber tokens in txt and wav files, forced_aligner.py: wavfile was added as arg for multipass, multipass.py: now getting wavfile as arg, either were added case if first or last token in txt file does not found in transcript or audio and case if start_pos value less than 0 because shift_start can shift it to negative number

python/vosk/aligner/full_transcriber.py

python/vosk/aligner/multipass.py

python/aligner/test_align.py

python/vosk/aligner/recognizer.py

…ords was added as variable for left/right words around NFIA or NFIT words; property names was added for non success cases instead of numbers

… recognizer to process_text, either for forced_aligner.py and multipass.py; in cats.txt, dagon.txt, glorious.txt, polar.txt was added mistakes for testing, fixed bug in polar.wav, deleted unused wendy example, added log files cats, dagon, glorious, polar for tests; fixed mistakes in test_align.py, added asserts; vosk_align.py: added logging for msgs and opportunity to call vosk_align.py from test_align.py

CodeFusionFX

I'm rooting for you and your work! Please keep up the great work!

dynodino · 2022-04-14T18:56:54Z

python/vosk/aligner/transcription.py

+        options = {
+                'sort_keys':    True,
+                'indent':       4,
+                'separators':   (',', ': '),


Also 'ensure_ascii': False,

There is also a lot of trailing whitespace in the code.

(Nice PR)

dynodino · 2022-04-14T18:57:23Z

python/vosk/aligner/diff_align.py

+
+    for op, a, b in word_diff(hypothesis, reference):
+
+            try:


double indented (8 instead of 4 spaces)

dynodino · 2022-04-15T06:25:44Z

python/vosk/aligner/forced_aligner.py

+            amount, length = unalign(words)
+            logging.info("%d unaligned words (of %d)", amount, length)
+
+        if amount != 0:


amount is unassigned if logging is None

Also, amount != 0 is duplicated

Also, progress_cb could be None.

Hello, thanks for you help, I will fix it :)

Hello, thanks for you help, I will fix it :)

Hello did you have success fixing it?

Hello, no for a while, but I hope to start it after finish my current project

I returned to aligner project, need to rework code a bit

Wonderful 👍 So excited, can't wait to try. @vadimdddd Do you an email or way I contact you to collaborate? Would love to share some thoughts and ideas.

@CodeFusionFX, sure thing - [email protected]

… spaces; transcription.py: added parameter in options; forced_aligner.py: deleted duplicated amount condition

…ed_aligner.py; vosk_align.py: get_result(args) was extracted from main() for testing; test_align.py: passing of args for testing has been changed to get_result(args)

ryanfb · 2022-06-30T15:37:04Z

In testing this locally, there needs to be an empty __init__.py file at python/vosk/aligner/__init__.py, otherwise I would get ModuleNotFoundError: No module named 'vosk.aligner' when trying to run vosk-aligner or vosk_align.py.

vadimdddd · 2022-06-30T17:36:05Z

@ryanfb thx for the info. I will fix it.

Laurian · 2023-01-23T06:03:42Z

I'm really interested in this PR, is there anything I can do to help?

nshmyrev · 2023-01-25T22:29:24Z

@Laurian just ping me if I forget please, I'll try to merge it

CodeFusionFX · 2023-02-06T05:54:52Z

@Laurian just ping me if I forget please, I'll try to merge it

I am also very interested in this as well. What can I do to help?

Laurian · 2023-03-30T18:33:48Z

@nshmyrev ping 🙏

CodeFusionFX · 2023-04-10T04:18:10Z

@nshmyrev
Ping hoping this get merged. Anxiously excited for this merge since last year what can the community do to help?

finnnnnnnnnnnnnnnnn · 2023-09-07T19:07:38Z

@nshmyrev
Really hoping this can get merged.

CodeFusionFX · 2024-08-16T06:41:46Z

Is this Pull dead?

add aligner

580067b

nshmyrev requested changes Nov 9, 2021

View reviewed changes

vadimdddd added 13 commits November 10, 2021 15:16

deleted unused method in transcription.py

2689046

deleted unused class in forced_aligner, reworked method unalign, chan…

f5b6aa8

…ged value name LM to recognizer

fix name mistake nof to not

199a0ae

rename script and class language_mode to recognizer/recognizer

a63af1d

rename value language_model to recognizer, deleted value duration

ee14305

changed transcriber.py fixed frames duration (add mul by 2), deleted …

7dc3906

…duration as parametr(input/output), set realign case audiofile start position as tell(), splited lines for more readable script

replaced gentle lucier wav/txt examples to new glorious wav/txt examples

c8d4ac2

fixed mistakes in textfile

b0d5464

diff_align(comment mistakes fix), transcription(changed type of SUCCE…

0319500

…SS, NFIA, NFIT attributes from str to int)

fixed comment mistakes, changed w.case type str to int, changed algor…

8e4ccc6

…ithm of obtaining chunk's start/end idxs, deleted condition non-existing start/end edges of chunk, added adjustment values shift_start/end tuning left/right edges of chunk

deleted unused values

76414d4

tests was added, 4 examples cats, dagon, polar, wendy with chaotic mi…

cd733ce

…stakes inside, either test_align.py script with 5 tests which using pytest, vosk_align.pt was modified for testing outside

nshmyrev changed the title ~~add aligner(merging gentle and vosk projects)~~ Add aligner Dec 7, 2021

nshmyrev requested changes Dec 7, 2021

View reviewed changes

vadimdddd added 5 commits December 8, 2021 18:03

deleted unused script full_transcriber.py; in multipass.py: reserve_w…

e6ef514

…ords was added as variable for left/right words around NFIA or NFIT words; property names was added for non success cases instead of numbers

replaced recognize to recognizermethod name

0d988ca

delete unused line

9ebb0dd

fixed exception mistake

2c9458e

nshmyrev force-pushed the master branch from a490f35 to b090341 Compare January 21, 2022 12:22

vadimdddd added 3 commits January 26, 2022 22:54

fixed commit "delete unused line", changed line with call main function

0444439

deleted unused module

b4962f7

add_align.py was added into bin script with setup.py machinery

1d5f66d

CodeFusionFX reviewed Apr 9, 2022

View reviewed changes

dynodino reviewed Apr 14, 2022

View reviewed changes

dynodino reviewed Apr 15, 2022

View reviewed changes

vadimdddd and others added 5 commits April 18, 2022 17:50

changed back setup.py verison of vosk

8bfcf3f

vosk_align.py: changed args for main(); diff_align.py: changed 8 to 4…

14a8f75

… spaces; transcription.py: added parameter in options; forced_aligner.py: deleted duplicated amount condition

replaced example, scripts, test folders; fixed algorithm bugs in forc…

af13309

…ed_aligner.py; vosk_align.py: get_result(args) was extracted from main() for testing; test_align.py: passing of args for testing has been changed to get_result(args)

deleted model

f8cbdb9

Merge branch 'master' into add_aligner

3babab0

antiboredom mentioned this pull request Jun 22, 2022

Automatically refine word-level alignments from sentence-level alignments antiboredom/videogrep#106

Open

vadimdddd and others added 12 commits June 30, 2022 19:47

added __init__.py, changed vosk_align.py script structure

5e5309b

merge branches

14bbd3d

Merge branch 'alphacep:master' into add_aligner

efdfd85

delete init

bec083b

add init

6b688c6

Merge branch 'alphacep:master' into master

66d0808

Merge branch 'master' into add_aligner

a1ec8b7

Merge branch 'alphacep:master' into add_aligner

bdde7af

add init

fc848c1

fix setup.py

4ef5dcb

fix init

92e68eb

changed output file name

081f0ea

sridhar1ga mentioned this pull request Sep 10, 2023

Implement long aligner #337

Open

finnnnnnnnnnnnnnnnn mentioned this pull request Sep 14, 2023

Would it be possible to use VOSK instead of wav2vec in order to force alignment? m-bain/whisperX#463

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add aligner #756

Add aligner #756

vadimdddd commented Nov 9, 2021 •

edited

Loading

nshmyrev commented Nov 9, 2021

CodeFusionFX left a comment

dynodino Apr 14, 2022

vadimdddd May 25, 2022

dynodino Apr 14, 2022

vadimdddd May 25, 2022

dynodino Apr 15, 2022 •

edited

Loading

vadimdddd Apr 15, 2022

CodeFusionFX May 9, 2022

vadimdddd May 9, 2022

vadimdddd May 25, 2022

CodeFusionFX Jun 3, 2022

vadimdddd Jun 3, 2022 •

edited

Loading

ryanfb commented Jun 30, 2022

vadimdddd commented Jun 30, 2022

Laurian commented Jan 23, 2023

nshmyrev commented Jan 25, 2023

CodeFusionFX commented Feb 6, 2023

Laurian commented Mar 30, 2023

CodeFusionFX commented Apr 10, 2023

finnnnnnnnnnnnnnnnn commented Sep 7, 2023

CodeFusionFX commented Aug 16, 2024

Add aligner #756

Are you sure you want to change the base?

Add aligner #756

Conversation

vadimdddd commented Nov 9, 2021 • edited Loading

nshmyrev commented Nov 9, 2021

CodeFusionFX left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dynodino Apr 15, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vadimdddd Jun 3, 2022 • edited Loading

Choose a reason for hiding this comment

ryanfb commented Jun 30, 2022

vadimdddd commented Jun 30, 2022

Laurian commented Jan 23, 2023

nshmyrev commented Jan 25, 2023

CodeFusionFX commented Feb 6, 2023

Laurian commented Mar 30, 2023

CodeFusionFX commented Apr 10, 2023

finnnnnnnnnnnnnnnnn commented Sep 7, 2023

CodeFusionFX commented Aug 16, 2024

vadimdddd commented Nov 9, 2021 •

edited

Loading

dynodino Apr 15, 2022 •

edited

Loading

vadimdddd Jun 3, 2022 •

edited

Loading