Great job! I was wondering how did you evaluate LLaVA-v1.6, Qwen2-VL, or Qwen2.5-VL? Did you still use [LLaVA](https://github.com/haotian-liu/LLaVA) evaluation settings?