<<<CP1234>>> SPEECH PROCESSING AND SYNTHESIS

{{{credits}}}

L	T	P	C
3	0	0	3

Course Objectives

To explore the fundamentals of digital speech processing.
To understand the basic concepts and algorithms of speech processing.
To familiarize the students with the various speech signal representation, coding and recognition techniques.
To study the concepts and evaluation method of speech synthesis.

{{{unit}}}

Unit I

Fundamentals of Digital Speech Processing

9

Introduction: Discrete-Time signals and systems – Transform representation of Signals and systems – Fundamentals of digital filters – Sampling; Process of Speech Production – Acoustic theory of speech production – Digital models for speech signals.

{{{unit}}}

Unit II

Speech Signal Analysis in Time Domain

9

Time-dependent processing of speech – Methods for extracting the parameters: Energy – Average Magnitude – Zero-crossing rate; Slience discrimation using ZCR and energy – Short-time autocorreleation function – Pitch period estimation using autocorrelation function.

{{{unit}}}

Unit III

Speech Signal Analysis in Frequency Domain

9

Short time fourier analysis – Fourier transform and linear interpretations – Sampling rates – Spectrographic Displays – Formant extraction – Pitch extraction – Linear predictive coding: Autocorrelation method – Covariance method; Solution of LPC equations – Durbin’s Recursive solution – Application of LPC parameters – Pitch detection.

{{{unit}}}

Unit IV

Speech Recognition

9

Introduction – Preprocessing – Parametric representation – Speech segmentation – Dynamic time warping – Vector quantization – Hidden Markov Model – Language Models – Developing an isolated digit recognition system.

{{{unit}}}

Unit V

Speech Synthesis

9

Attributes of speech synthesis – Formant speech synthesis – Concatenative speech synthesis – Prosodic modification of speech – Source filter models for prosody modification – Evaluation of TTS system.

\hfill Total: 45

Course Outcomes

After the completion of this course, students will be able to:

Illustrate how the speech production is modelled (K2)
Extract features from the speech signal in both time and frequency domain (K3)
Developing a speech recognition system using statistical approach (K3)
Compare the various methods of speech synthesis (K2)

References

L. R. Rabiner and R. W. Schaffer, “Digital Processing of Speech signals”, Prentice Hall, 1978.
Xuedong Huang, Alex Acero, Hsiao-Wuen Hon, “Spoken Language Processing – A guide to Theory, Algorithm and System Development”, Prentice Hall PTR, 2001.
Lawrence Rabiner and Biing-Hwang Juang, “Fundamentals of Speech Recognition”, Prentice Hall Signal Processing Series, 1993.
Thomas F.Quatieri, “Discrete-Time Speech Signal Processing”, Pearson Education, 2002.
Ben Gold and Nelson Morgan, “Speech and Audio Signal Processing”, John Wiley and Sons Inc., Singapore, 2004.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

234-Speech-Processing-and-Synthesis.org

234-Speech-Processing-and-Synthesis.org

<<<CP1234>>> SPEECH PROCESSING AND SYNTHESIS

Course Objectives

Course Outcomes

References

Files

234-Speech-Processing-and-Synthesis.org

Latest commit

History

234-Speech-Processing-and-Synthesis.org

File metadata and controls

<<<CP1234>>> SPEECH PROCESSING AND SYNTHESIS

Course Objectives

Course Outcomes

References