Chair of
Multimedia Communications and Signal Processing
Prof. Dr.-Ing. André Kaup

Speech and Audio Signal Processing

Lecturer:Prof. Dr.-Ing. Walter Kellermann
Tutor:M.Sc. MHD Modar Halimeh
Type:Lecture
Lecture language:English
Time Lecture:Mon 14:15-15:45 H15*
Tue 8:15-9:45 H5*
Time Supplements:Thu 8:15-9:45 06.021*
Wed 18:15-19:45 06.021*
Credit Points:5 ECTS*
Hours (Lecture):3
Hours (Exercise):1
Prerequisites:Signals and Systems, Digital Signal Processing

All data marked with a * are directly imported from UnivIS


Contents
The course concentrates on algorithms for speech and audio signal processing with applications in telecommunications and multimedia, especially
  • Human Speech Production and Hearing: Source-Filter-Model, Filterbank Model of Cochlea; Masking effects;

  • Signal Representation for Speech and Audio: Estimation and representation of short-term and long-term statistics in time domain, frequency domain, and cepstral domain; typical examples and visualisations;

  • Source Coding for Speech and Audio: criteria; scalar and vectorial quantization and coding; linear prediction; pitch prediction; waveform/ parameter/ hybrid coding; standards (ITU, GSM, ISO-MPEG)

  • Fundamental Concepts of Automatic Speech Recognition: Feature extraction, Dynamic Time Warping, Hidden Markov Models

  • Fundamental Concepts of Speech Synthesis: Text-to-Speech systems, modell-based and data-driven synthesis, PSOLA synthesis

  • Signal enhancement for acquisition and reproduction: Noise reduction, echo cancellation, dereverberation using single-channel and multichannel algorithms;

Literature
Recommendations for each topic are given during the lectures