Skip to content

Latest commit

 

History

History
135 lines (122 loc) · 5.99 KB

PE303-Speech-Processing-and-Synthesis.org

File metadata and controls

135 lines (122 loc) · 5.99 KB

<<<PE303>>> SPEECH PROCESSING AND SYNTHESIS

CO PO MAPPING

PO1PO2PO3PO4PO5PO6PO7PO8PO9PO10PO11PO12PSO1PSO2PSO3
K3K4K5K5K6-------K5K3K6
CO1K2221010011101121
CO2K3322010011101231
CO3K3322010011101231
CO4K3322010011101231
CO5K2221010011101121
Score131080500555058135
Course Mapping322010011101231

{{{credits}}}

LTPC
3003

COURSE OBJECTIVES

  • To explore the fundamentals of digital speech processing
  • To understand the basic concepts and algorithms of speech processing
  • To be familiar with the various speech signal representation, coding and recognition techniques
  • To study the concepts and evaluation methods of speech synthesis.

{{{unit}}}

UNIT IFUNDAMENTALS OF DIGITAL SPEECH PROCESSING9

Introduction: Discrete-time signals and systems – Transform representation of signals and systems – Fundamentals of digital filters – Sampling; Process of speech production – Acoustic theory of speech production – Digital models for speech signals.

{{{unit}}}

UNIT IISPEECH SIGNAL ANALYSIS IN TIME DOMAIN9

Time-dependent processing of speech – Methods for extracting the Parameters: Energy – Average magnitude – Zero-crossing rate; Silence discrimation using ZCR and energy – Short-time autocorreleation function – Pitch period estimation using autocorrelation function.

{{{unit}}}

UNIT IIISPEECH SIGNAL ANALYSIS IN FREQUENCY DOMAIN9

Short time fourier analysis – Fourier transform and linear interpretations – Sampling rates – Spectrographic displays – Formant extraction – Pitch extraction – Linear predictive coding: Autocorrelation method – Covariance method; Solution of LPC equations – Durbin’s Recursive solution – Application of LPC parameters – Pitch detection.

{{{unit}}}

UNIT IVSPEECH RECOGNITION9

Introduction – Preprocessing – Parametric representation – Speech segmentation – Dynamic time warping – Vector quantization – Hidden Markov Model – Language Models – Developing an isolated digit recognition system.

{{{unit}}}

UNIT VSPEECH SYNTHESIS9

Attributes of speech synthesis – Formant speech synthesis – Concatenative speech synthesis – Prosodic modification of speech – Source filter models for prosody modification – Evaluation of TTS system.

\hfill Total Periods: 45

COURSE OUTCOMES

After the completion of this course, students will be able to:

  • Illustrate how the speech production is modelled (K2)
  • Extract features from the speech signal in time domain (K3)
  • Analyze the speech signal in frequency domain (K3)
  • Develop a speech recognition system using statistical approach (K3)
  • Compare the various methods of speech synthesis (K2).

TEXT BOOKS

  1. L R Rabiner, R W Schafer, “Digital Processing of Speech Signals”, Pearson Education, Delhi, India, 2004.
  2. Xuedong Huang, Alex Acero, Hsiao-Wuen Hon, “Spoken Language Processing – A guide to Theory, Algorithm and System Development”, Prentice Hall PTR, 2001.

REFERENCES

  1. L R Rabiner, B H Jhuang, B Yegnanarayana, “Fundamentals of Speech Recognition”, Pearson Education, 2009.
  2. Thomas F Quatieri, “Discrete-Time Speech Signal Processing”, Pearson Education, 2002.
  3. Ben Gold, Nelson Morgan, “Speech and Audio Signal Processing”, John Wiley and Sons Inc, 2004.
  4. J R Deller Jr, J H L Hansen, J G Proakis, “Discrete-Time Processing of Speech Signals”, Wiley-IEEE Press, NY, USA, 1999.
  5. Daniel Jurafsky, James H Martin, “Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition”, 2nd edition, Pearson education, 2013.