Jump to content

IT60116: Advanced Topics In Speech Processing

From Metakgp Wiki
IT60116
Course name ADVANCED TOPICS IN SPEECH PROCESSING
Offered by Information Technology
Credits 3
L-T-P 3-0-0
Previous Year Grade Distribution


1






EX A B C D P F
Semester {{{semester}}}


Syllabus

Syllabus mentioned in ERP

Prerequisite: NoneSpeech production and perception mechanisms, Speech Signal Processing Methods; Knowledge sources in speech: Time domain and frequency domain, Spectrograms, Knowledge sources at segmental, sub-segmental and suprasegmental (prosodic) levels, excitation source, vocal tract system and higher level knowledge sources and linguistic and semantic knowledge. Modeling techniques for developing speech systems: Vector quantization, Hidden Markov models, Gaussian mixture models, Support vector machines and Neural networks; Speech Coding: Coding of speech signals, Waveform coding, Speech-specific coders; Speech Recognition: Issues in speech recognition, Isolated word recognition, Connected word recognition, Continuous speech recognition, Large vocabulary continuous speech recognition; Speech Synthesis: Issues in speech synthesis, Models for speech synthesis, Different speech synthesis systems, Prosodic aspects in speech synthesis, Development of speech synthesis system. Evaluation methodologies for speech synthesis systems; Speaker Recognition: Issues in speaker recognition, Speaker verification vs identification, Textdependent vs text-independent speaker recognition, Development of speaker recognition systems; Speech Enhancement: Enhancement of noisy speech, Enhancement of reverberant speech, Enhancement of multi-speaker speech.


Concepts taught in class

Student Opinion

How to Crack the Paper

Classroom resources

Additional Resources