Research Associate @ Imperial College London. I work on improving audio-visual speech recognition models through LLMs.
-
Imperial College London
- London
- https://umbertocappellazzo.github.io/
- @Umberto_Senpai
- in/umberto-cappellazzo-116093150
Pinned Loading
-
Llama-AVSR
Llama-AVSR PublicOfficial Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigating Attention Sinks and Massive Activations in Audio-Visual …
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

