Phonetic signals and their coding

B232 - Summer 23/24

XP31FSK

Phonetic signals and their coding - XP31FSK

Credits	4
Semesters	Summer
Completion	Exam
Language of teaching	Czech
Extent of teaching	2P+2S

Annotation

The subject introduces the processing of speech signals. Within the subject students should manage from basic to advanced and modern algorithms of speech analysis, synthesis, coding or enhancement. Further reasonable part is focused on speech recognition, where students will get to know modern and advanced technique in task as small and large vocabulary speech recognition or speaker recognition. Special attention is devoted to usage of classification techniques based on GMM, DTW, HMM, ANN/DNN, WFST, JFA, i-vectrors, etc.

Study targets

No data.

Course outlines

1. Speech production and perception model, phonetic description of speech
2. Spectral characteristics of speech (DFT, LPC, filter banks)
3. Cepstral reprezentation of speech and possible applications
4. Voice activity detection and speech enahncement.
5. Speech synthesis
6. Speech coding
7. Basic and advanced feature extraction techniques (PCA, LDA)
8. Classification approaches for particular ASR tasks (GMM, HMM, VQ)
9. Modern methods of speaker verification and identification (UBM-GMM, JFA, i-vectors)
10. DTW- and HMM-based speech recognition
11. Continuous speech recognition, language modelling, WFST
12. Adaptation techniques in speech recognition
13. Modern ASR systems based on ANN/DNN, methods of deep learning
14. Reserve.

Exercises outlines

Seminars are organized as common consultations of registered students. The main focus is paid on individual work of students during the semester and a their work on chosen individual topics. The solutions of individual projects are discussed at common consultations.

Literature

[1] Deller, J.R. - Hansen, J.H.L. - Proakis, J.G.: Discrete-time processing of speech signals, New York: IEEE Press 2000, 908 s., ISBN
0-7803-5386-2
[2] http://www.ee.ic.ac.uk/hp/staff/dmb/courses/speech/speech.htm

Requirements

Digital signal processing are supposed as preliminary knowledge.