Skip to content

Conversation

@Ringssss
Copy link

No description provided.

@Ringssss Ringssss requested a review from a team January 15, 2026 16:21
@Ringssss
Copy link
Author

MiniCPM-V 真实图片吞吐(bs=1/8/16/32/64)

脚本:scripts/bench_minicpmv_mm_kv_compression_throughput.py

python scripts/bench_minicpmv_mm_kv_compression_throughput.py
--dev nvidia
--model-dir /data/huggingface/MiniCPM-V-2_6
--image /home/zhujianian/cvpr/wuhang/bus.jpg
--question "图片是什么?"
--batch-sizes 1,8,16,32,64
--max-tokens 768
--decode-steps 64
--warmup-steps 4
--compress-bin ./compress_ckpt/minicpmv_mlp_local.bin
--compression-factor 5
--min-seq-len 2

样例结果(decode TPS;prompt_len=519,image_kv_len=492):

  • bs=1: 93.83 → 94.35
  • bs=8: 319.54 → 322.85
  • bs=16: 349.37 → 353.19
  • bs=32: 368.67 → 372.20
  • bs=64: 376.67 → 380.27

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

确定要在仓库里加bin文件?

#define INFINICORE_INFER_H

#include "infinicore_infer/cache.h"
#include "infinicore_infer/kv_compression.h"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

所以这个功能还是在0.1.0上加的?

11,
0
]
} No newline at end of file
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

文件结尾空行

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants