Experience
Speech Enhancement/Separation
Take me to pookie
GAN-based speech enhancement for CI users [ less ]
An important question in developing DNN-based speech enhancement systems for CI users is, what optimization criterion should be used to train the networks? Mean square error (MSE) or log-MSE measures are normally used, but research in CI domain has shown that these measures do not correlate well with the speech intelligibility of CI users. Conditional generative adversarial network (CGAN) provides a solution by replacing the optimization loss functions with a discriminator network. In this project, I am exploring different methods of leveraging CGAN-based architectures to improve the speech intelligibility of CI users.
CNN-based speech enhancement in CI auditory space [ read more ]
Speech separation using probabilistic PIT [ read more ]
Emotion Recognition
Recognizing continuous emotion labels using MDS network [ read more ]
Recognizing continuous emotion labels using large receptive field networks [ read more ]
Progressive neural networks for transfer learning in computational paralinguistics [ read more ]
Multimodal emotion recognition from speech and text [ read more ]
Mood Recognition
Statistical Parametric Speech Synthesis
Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis —————————————
Statistical Parametric Speech Synthesis
Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis ————————————— Statistical Parametric Speech Synthesis —————————————