Skip to main content

Section 2.4 Current PhD and Graduate Students

  1. Li Haoyang, Neural Text to Speech (reg Aug 2023, Alibaba-PhD)
  2. Tuan Truong Duc, Robust Speaker verification under noisy and short duration scenario (reg Jan 2023, PhD)
  3. Fabian Ritter Gutierrez,End2End ASR for Language Learning (reg Aug 2022, AStar scholar) (co-supervised with AStar: Nancy Chen
  4. Nikita Kuzmin, Disentaglement for Speaker verification and privacy (reg Aug 2022, AStar scholar) (co-supervised with AStar: Lee Kong Aik
  5. Kwok Chin Yuen, Acoustic modelling of targetted domain speech (Children's speech acoustic modelling) (reg Aug 2021, MEng, converted to PhD program, Aug 2022)
  6. Hu Yuchen 胡宇晨 , robust End-to-End ASR (reg Aug 2021, MEng, converted to PhD program, Aug 2022). QE Slides (2023),
  7. Yip Jia Qi, Neural Networks for Speaker Extraction and its interdisciplinary applications (reg Aug 2021, Alibaba PhD)
  8. Ng Dian Wen, Domain adaptation for End-to-End ASR (reg Jan 2021, Alibaba PhD)
  9. Chen Chen, End-to-End ASR (reg Jan 2021, PhD)
  10. Zou HeQing, Multimodal Machine Learning (reg Jan 2021, PhD) (Co-supervisor) (Sup:Deepu Rajan)
  11. Rae Koh Jia Xin, Singapore English (reg Aug 2019, PhD)

Subsection 2.4.1 Current MEng and Masters Program Students

  1. Qin Xiaokai (2023 Aug~), MSAI student, Deepfake audio generation using voice conversion
  2. Azmat Adnan (2023 Aug~), MSAI student, DNN Approaches for noisy speech diarization
  3. Zhuo Ning (2023 Aug~), Masters Cybersecurity student, Deep Fake Corpus developmenet and detection

Subsection 2.4.2 Collaborating graduate Student

  1. Yang Yuhang (2023 June~, PhD student (Hunan University), China), LLM ASR
  2. Zhang Xiangyu (2023 June~, PhD student (UNSW, Australia), Depression classification
  3. Chen Weiguang (2022 June~, PhD student (Hunan University), Diarization using multi-channel approaches

Subsection 2.4.3 Collaborating graduate Student (China)

Every year, we will host graduate students from China. We have hosted students from China Scholarship Council (program), Xinjiang University, Tianjin University and Northwestern University. The visits have been very rewarding, and many publications have come out of these visits. We hope to see more such outstanding students, so do apply!

  1. Le Yuquan (2023 Oct~ Oct 2024, CSC visitor), PhD student (Hunan University, China), LLM for Legal
  2. Luo Juan (2023 Oct~ Oct 2024, CSC visitor), PhD student (Hunan University, China), Audio event detection and classification
  3. Bo Han (2023 Oct ~Oct 2024, CSC visitor), PhD student Zhejiang Uni, China), Deep Fake TTS audio generation
  4. Zhao Yang (2021 Oct ~2023 Oct, CSC visitor, PhD student Xian JiaoTong Uni, China), Semi/Self supervised representation for speech recognition
  5. Peng Yizhou (2020 June ~2022 June, Masters student Xinjiang, China), ASR development (Kaldi and End2End)
  6. Yang Yuhang (2021 June~2023 June, Masters student Xinjiang, China), WeNet ASR, End2End ASR
  7. Guo Yachao (2021 June~2023 June, Masters student Xinjiang, China), End2End Hotword LM Adaptation

Subsection 2.4.4 Past PhD Students

  1. Andrew Koh Jin Jie, Sequence to Sequence Machine Learning (reg Aug 2019, PhD, submitted Thesis)
  2. Zhao Yingzhu, End-to-End speech recognition (reg Jan 2019, PhD, graduated May 2023). Oral Defence rehearsal PhD Slides PhD report PhD latex folder
  3. Hou Nana, Robust LVCSR for air traffic control speech (reg Jan 2017, PhD, submitted Jan 2022)
  4. Xu Chenglin, PhD Slides PhD Thesis (2020) Single Channel Multi-talker Speech Separation with Deep Learning
  5. Paul Chan, Synthesis of the human singing voice (2020)
  6. Khassan Yerbolat, PhD Slides, Online Presentation(April 2020) and final PhD thesis., (2020) Language Model Domain Adaptation for Automatic Speech Recognition Systems.
  7. Pham Van Tung, PhD Thesis(2019) Robust Spoken Term Detection using partial search and re-scoring hypothesized detections techniques. Now in NTU.
  8. Tian Xiaohai, PhD Thesis(2019) Voice Conversion with Parallel/Non-Parallel Data and Synthetic Speech Detection. Now in NUS.
  9. Chong Tze Yuang, PhD Thesis, Slides, Thesis organization, (2018) Exploiting Long Context Using Joint Distance and Occurrence Informationfor Language Modeling.
  10. Nguyen Duc Hoang Ha, PhD Thesis(2017) Slides Feature based robust techniques for speech recognition.
  11. Nguyen Trung Hieu, PhD Thesis(2015). Speaker Diarization in Meeting room domain. Now at Alibaba.
  12. Do Van Hai, PhD Thesis(2015). Acoustic modelling of speech under limited training data condition. Now in Vietnam Telecoms.
  13. Wu Zhizheng, PhD Thesis(2015). Spectral Mapping for Voice Conversion.
  14. Jonathan Dennis, PhD Thesis(2014). Slides, Sound Event Recognition in Unstructured Environments using Spectrogram Image Processing.
  15. Wang Lei, (2013). Audio Pattern Discovery and retrieval.
  16. Tong Rong, PhD Thesis(2012). Towards high performance phonotactic features for spoken language recognition. Now at Alibaba.
  17. Omid Dehzanghi, (2012). Discriminative Learning for speech recognition, U of Michigan
  18. Xiao Xiong, PhD Thesis (2009). PhD Thesis: Robust speech features and acoustic models for speech recognition. QE (2006),Speech Enhancement with Applications in speech recognition, now in Microsoft, US since Apr 2017
  19. Wang Jinjun, PhD (2008), Content based sports video analysis and composition. Now in Xian Jiaotong.

Subsection 2.4.5 Past MEng Students

  1. Tanmay Surana, Deep Learning-based Text Augmentation for Named Entity Recognition (reg Aug 2021, MEng, completed Oct 2023)
  2. Prachaseree Chaiyasait, Adaptation of Language Models via Text Augmentation (reg Aug 2021, MEng, submitted Jul 2023, completed Oct 2023)
  3. Kyaw Zin Tun, Name entity recognition for chatbot applcications(MEng, started Aug 2020, submitted thesis Aug 2022)
  4. Xue Fuzhao, Information extraction from text (MEng 2020)
  5. Lim Zhi Hao, (MEng 2020), Anti-Spoofing Techniques for Robust Speaker VerificationThesis (2020)
  6. Ho Thi Nga, (MEng 2019), Sentence unit detection for automatic speech transcripts using lexical information
  7. Leow Sujun, (MEng 2018), Image Processing Technique for Speech Signal Processing
  8. Nguyen Quy Hy, (MEng 2017), Voice conversion using DNN
  9. Steven Du, (MEng 2015), Robust Front End for Speaker Verification
  10. Terrence Ng Wen Zheng, Thesis,(MEng 2014), Sound Event recognition in home environment
  11. Chen Wenda, (MEng 2014),Computer Assisted Language Learning
  12. Ben Pham Chau Khoa, Thesis,(MEng 2012), Robust VAD
  13. Eugene Koh, (MEng 2009), Speaker Diarizaton

Subsection 2.4.6 Past MSAI Students and Other collaboration students

  1. Jiang Yufei (2022 Aug~ 2023 Aug), MSAI (NTU), Adopting Neural Translation Model in Data Generation for Inverse Text Normalization
  2. Liu Jiaxing (2021 Oct ~2023 Oct, CSC visitor, PhD student Tianjin University, China), Multi-modal emotion recognition
  3. Cheng Qi (2021 Oct ~2022 Oct, CSC visitor, PhD student Harbin Engineering Uni, China), Graph Neural Network for Lattice rescoring
  4. Samuel Samsudin Ng, (MSAI 2020-S1), Speech emotion recognition with AlexNet and Fully convolutional network, Sam's MSAI Thesis, github depository, kaggle iEmoCap
  5. Cheung Chin Ka, (MSAI 2020-S1), Acoustic Scene Classication with cutting edge hyperparameter tuning tool, Andy's MSAI Thesis
  6. Liu Bozhong, (MSAI 2020-S2), Wakeup keyword detection for far-field microphone array using end to end framework