Error on first trial: Passing sample rate to mfcc_hires.conf and doing the decoding #1

SvenST89 · 2022-05-12T15:23:35Z

Hey there!
First of all: Nice tutorial. Easy to follow and well-explained.
Yet, I encountered some errors on the first trials.
I have one question and two suggestions/comments for improvement.

1. What does line 61 in main.py really do? I could not figure out its sense. So I adjusted it for my needs accordingly (see bullet point 2)

2. I adjusted the code section in which we pass the sample rate to mfcc_hires.conf. I added the strip()-method to line 60 as the code was throwing an error on the first execution, as I had trailing spaces. So my suggestion looks as follows:

# Reformat the line to use the sample rate of the .wav file

line = line.strip().split("=")
print("list of line elements in mfcc_hires.conf file: ", line)
line[1] = sample_rate # overwrites the sample rate in the list 'line' at index position '1'
myseparator="="
line = myseparator.join(line)

3. I created a Kaldi-like 'text' file as the decoding step did not work without this file.

The text was updated successfully, but these errors were encountered:

completelyboofyblitzed · 2022-07-13T19:41:32Z

Hey @SvenST89! Thank you for sharing, I bumped into a text file absent problem too, which kind of text file does it need?

SvenST89 · 2022-07-15T13:42:24Z

Hi @kak-to-tak, this text file contains transcriptions of each utterance in the audio file. If speaker information in your project setup is available, then the structure of each line in this 'text' file could may have the following structure: <speaker_id>_<utterance_ID> <transcription of each sentence/segment if you have segmented the audio file>. Check the Kaldi dummy tutorial here to get an idea of it. Usually, you have to prepare such a training file manually and make the transcription of the file. Why? You need to train the algorithm. If you do not train the 'brain' and feed it with transcriptions the algo will not learn how to transcribe.

Check also this Kaldi tutorial to get a glimpse of the functioning of Kaldi.

completelyboofyblitzed · 2022-07-20T08:29:03Z

Got it, thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error on first trial: Passing sample rate to mfcc_hires.conf and doing the decoding #1

Error on first trial: Passing sample rate to mfcc_hires.conf and doing the decoding #1

SvenST89 commented May 12, 2022 •

edited

Loading

completelyboofyblitzed commented Jul 13, 2022

SvenST89 commented Jul 15, 2022

completelyboofyblitzed commented Jul 20, 2022

Error on first trial: Passing sample rate to mfcc_hires.conf and doing the decoding #1

Error on first trial: Passing sample rate to mfcc_hires.conf and doing the decoding #1

Comments

SvenST89 commented May 12, 2022 • edited Loading

completelyboofyblitzed commented Jul 13, 2022

SvenST89 commented Jul 15, 2022

completelyboofyblitzed commented Jul 20, 2022

SvenST89 commented May 12, 2022 •

edited

Loading