Hi — this is very interesting work.
I had a quick question regarding the training data. In Appendix A and Table 10, it is mentioned that TVBench, STI-Bench, and MMR-VBench are used during training. However, these benchmarks are released strictly for validation and benchmarking purposes.
Could you please clarify how they are being used in training?
