[2025 TPAMI] Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation
audio-visual-segmentation audio-visual-video-parsing audio-visual-event-localization parameter-efficient-fine-tuning audio-visual-question-answering memory-efficient-fine-tuning
-
Updated
Jan 3, 2026 - Python