umbertocappellazzo

Umberto Cappellazzo umbertocappellazzo

Research Associate @ Imperial College London. I work on improving audio-visual speech recognition models through LLMs.

Achievements

Llama-AVSR Llama-AVSR Public

Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigating Attention Sinks and Massive Activations in Audio-Visual …

Python 52 4
Omni-AVSR Omni-AVSR Public

Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models".

Python 28 2
PETL_AST PETL_AST Public

This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…

Python 38 4
CL_SLU CL_SLU Public

"An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding", accepted at INTERSPEECH 2023.

Python 8