ICLR 24 (spotlight), ASRU 23, and ICASSP 24 (oral) papers are accepted

February 2, 2024

ASRU 23

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting, ASRU 2023

Work done as individual contributor and tech lead in Amazon AGI

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Work done as intern manager with the leading author Yu Yu in Amazon AGI

ICLR 24

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition, Spotlight Presentation

It’s Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

ICASSP 24

Can whisper perform speech-based in-context learning?

Oral, Best paper candidate

Hot-fixing wake word recognition for end-to-end ASR via neural model reprogramming

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Towards ASR robust spoken language understanding through in-context learning with word confusion networks