Internship in Speech Lab@SCSE, NTU

Section 2.5 Internship in Speech Lab@SCSE, NTU

We have a tradition of hosting senior undergraduate students (final year) and graduate students (Masters and PhD) in our lab.

Typically we only host students who can work with us for 6 months or more in 2 sessions. Session 1: Jan-June, Session 2: Jul->Dec each year. During covid (it will be remote), and if NTU allows student to come to Singapore, we will prefer you to be physically in our lab. Currently we only host students who are doing this as part of their course sanctioned by their school and its graded *so its not a personal arrangement*.

To apply, you should be writing to us at least 1/2 year before the start of the attachment. Kindly send me an email with a detailed cv. As we receive huge number of applicants, only shortlisted students will receive a response.

We have two tracks, one for research and the other for development. For research, we expect students to have work with pyTorch, tensorflow, etc. For development, we expect experiene with google summer of code, full stack experience. During this internship, you will be actively working with our speech-team researchers on tasks such as implementing and realising state-of-the-art ML/DL techniques and deploying them for various tasks listed below:

Our current Research projects are:

DNN and End-to-end approaches for speech audio processing and classification

Classification from Audio: Speaker profiling (age, height, weight, accent, emotion) classification, Audio Event classification (DCASE), Speaker diarization and overlapping speech detection
Speech Enhancement and Audio processing: Enhancing noisy speech to clean DNN approaches, Deep Fake Speech: Modifying audio eavefiles to target environment/speaker/ noise using GAN

Our current Development projects are:

Terraform and cloud deployment of speech engine with scaling, auto-update, security and dashboard.
MAGOR - search and indexing speech, video and audio
Async speech recording and recognition Interface - recording and async update of speech recognised with update on name entity
Transcriptor - GUI for correction of erroneous recognised text

Upon completion of the project, we expect a formal project report + code repository, and if possible submit the work for publication.

Pls find attached a writeup by Shashank (2021 intern) onboarding help file, have a look. Onboarding writeup by Shashank

Current and Past Interns:

The list is incomplete, we began to the tradition of taking pictures with the students from India about 2017, there were many others. The pictures here mainly are undergraduate students, and details included inside.