Skip to main content

Section 2.5 Internship in Speech Lab@SCSE, NTU

We have a tradition of hosting senior undergraduate students (final year) and graduate students (Masters and PhD) in our lab.

Typically we only host students who can work with us for 6 months or more in 2 sessions. Session 1: Jan-June, Session 2: Jul->Dec each year. During covid (it will be remote), and if NTU allows student to come to Singapore, we will prefer you to be physically in our lab. Currently we only host students who are doing this as part of their course sanctioned by their school and its graded *so its not a personal arrangement*.

To apply, you should be writing to us at least 1/2 year before the start of the attachment. Kindly send me an email with a detailed cv. As we receive huge number of applicants, only shortlisted students will receive a response.

We have two tracks, one for research and the other for development. For research, we expect students to have work with pyTorch, tensorflow, etc. For development, we expect experiene with google summer of code, full stack experience. During this internship, you will be actively working with our speech-team researchers on tasks such as implementing and realising state-of-the-art ML/DL techniques and deploying them for various tasks listed below:

Our current Research projects are:

DNN and End-to-end approaches for speech audio processing and classification

  1. Classification from Audio: Speaker profiling (age, height, weight, accent, emotion) classification, Audio Event classification (DCASE), Speaker diarization and overlapping speech detection
  2. Speech Enhancement and Audio processing: Enhancing noisy speech to clean DNN approaches, Deep Fake Speech: Modifying audio eavefiles to target environment/speaker/ noise using GAN

Our current Development projects are:

  1. Terraform and cloud deployment of speech engine with scaling, auto-update, security and dashboard.
  2. MAGOR - search and indexing speech, video and audio
  3. Async speech recording and recognition Interface - recording and async update of speech recognised with update on name entity
  4. Transcriptor - GUI for correction of erroneous recognised text

Upon completion of the project, we expect a formal project report + code repository, and if possible submit the work for publication.

Pls find attached a writeup by Shashank (2021 intern) onboarding help file, have a look. Onboarding writeup by Shashank

Current and Past Interns:

The list is incomplete, we began to the tradition of taking pictures with the students from India about 2017, there were many others. The pictures here mainly are undergraduate students, and details included inside.

  1. 2023 June~Dec(fully remote)
  2. Some pictures of our group meetings with Interns from 2023(fully remote)
  3. Some pictures of our group meetings with Interns from 2022(fully remote)
  4. Some pictures of our group meetings with Interns from 2021(fully remote)
  5. Interns from 2020 (July-Dec) (fully remote)
  6. Interns from 2020 (Jan-May) (our last group before we could not have students in Singapore)
  7. Interns from 2019 (Nov, Grp1)
  8. Interns from 2019 (Nov, Grp2)
  9. Interns from 2019 (Jul)
  10. Interns from 2019 (Apr)
  11. Interns from 2018 (Oct)
  12. Interns from 2018 (Jul)
  13. Interns from 2018 (May)
  14. Interns from 2018 (Jan)
  15. Interns from 2017 (Jul)
  16. Gangeshwar Krishnamurthy,2017 (Jan-Apr)
  17. Gao Shengheng,2016 (Mar-Jun)