Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Mark Endo , Xiaohan Wang , Serena Yeung-Levy

In International Conference on Computer Vision (ICCV) 2025

Our repository is split into two sections, one for model code and the other for evaluation.

Setup

Clone the repository

git clone https://github.com/markendo/FEATHER
cd FEATHER

Install packages

conda create -n feather python=3.10 -y
conda activate feather
cd prismatic-vlms
pip install -e .
cd ../vlm-evaluation
pip install -e .
cd ..

Prepare data

The script for preparing evaluation datasets is at vlm-evaluation/scripts/datasets/prepare.py. More information about dataset preparation is available in the original codebase. Lastly, copy your HuggingFace token to vlm-evaluation/.hf_token.

Inference and Evaluation

We provide code for our experiments on evaluating various criteria for token pruning such as FastV, our modified version removing RoPE from the criteria, and our final FEATHER approach.

export DATASET_ROOT_DIR=/path/to/dataset/directory/
cd vlm-evaluation
bash scripts/eval_fastv.sh
bash scripts/eval_fastv_norope.sh
bash scripts/eval_feather.sh

Below are the results for RefCOCO and OCID-Ref.

Criteria	OCID-Ref	RefCOCOg	RefCOCO+	RefCOCO
FastV	5.8	5.0	6.6	7.6
FastV w/o RoPE	23.2	15.0	13.8	15.4
FEATHER	32.5	38.7	38.7	43.4

The main implementation of FEATHER is provided in prismatic-vlms/prismatic/models/backbones/llm/llama2_models.py. Note that results can vary slightly based on attention implementation.

Acknowledgments

This repository is built on top of the prismatic-vlms and vlm-evaluation codebases.

Citation

@article{endo2025feather,
  author    = {Endo, Mark and Wang, Xiaohan and Yeung-Levy, Serena},
  title     = {Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration},
  journal   = {ICCV},
  year      = {2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
prismatic-vlms		prismatic-vlms
vlm-evaluation		vlm-evaluation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Mark Endo , Xiaohan Wang , Serena Yeung-Levy

In International Conference on Computer Vision (ICCV) 2025

Setup

Inference and Evaluation

Acknowledgments

Citation

About

Uh oh!

Releases

Packages

Languages

License

markendo/FEATHER

Folders and files

Latest commit

History

Repository files navigation

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Mark Endo , Xiaohan Wang , Serena Yeung-Levy

In International Conference on Computer Vision (ICCV) 2025

Setup

Inference and Evaluation

Acknowledgments

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages