ASRU 23

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting, ASRU 2023

  • Work done as individual contributor and tech lead in Amazon AGI

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

  • Work done as intern manager with the leading author Yu Yu in Amazon AGI

ICLR 24

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition, Spotlight Presentation

Code

It’s Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

Code

ICASSP 24

Can whisper perform speech-based in-context learning?

  • Oral, Best paper candidate

Hot-fixing wake word recognition for end-to-end ASR via neural model reprogramming

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Towards ASR robust spoken language understanding through in-context learning with word confusion networks