Ph.D., Georgia Institute of Technology
I focus on 🗣️ speech-language alignment and scaling laws. Prior to joining NVIDIA, I spent time at Amazon AGI, working with Andreas Stolcke in Ivan Bulyko's team, and at Google Speech and Brain teams (now DeepMind), co-hosted by Bo Li and Yu Zhang in Tara N. Sainath's team.
🎓 My Ph.D. topic is on noise-robust speech post-training adaptation, advised by Prof. Chin-Hui Lee.
Exploring semantic and non-semantic alignment for LLMs.
Developing sample-efficient and cross-modal inference.
Building robust evaluation frameworks and intervention-resilient architectures.
A comprehensive tutorial on integrating LLMs with speech recognition systems, covering task-activating prompting and cross-modal alignment techniques.
Introduction to parameter-efficient adaptation methods for speech models, including prompt-tuning and in-context learning approaches.