Rank | Team Name | Score |
---|---|---|
1 | UWIPL_ETRI | 96.0789 |
2 | HCMUT.VNU | 91.9735 |
3 | Embia (smolRGPT) | 90.6772 |
4 | MIZSU | 73.0606 |
5 | HCMUS_HTH | 66.8861 |
6 | MealsRetrieval | 56.6352 |
7 | BKU22 | 50.3662 |
8 | Smart Lab | 31.9245 |
9 | AICV | 28.2993 |
Method | Parameters | Below/ Above | Left/ Right | Big/ Small | Tall/ Short | Wide/ Thin | Behind/ Front | Avg. |
---|---|---|---|---|---|---|---|---|
GPT-4 | 1.76T | 64.1 | 42.8 | 42.8 | 61.6 | 61.6 | 49.0 | 57.8 |
GPT-4V | 1.76T | 63.3 | 46.6 | 64.1 | 60.7 | 68.2 | 45.4 | 58.1 |
LLaVA-v1.6-34B | 34B | 44.1 | 45.7 | 36.7 | 53.5 | 37.5 | 45.4 | 43.9 |
GPT-4V+SoM | 1.76T | 75.0 | 55.2 | 42.4 | 54.4 | 49.0 | 47.2 | 54.3 |
LLaVA-v1.6-34B+SoM | 34B | 44.1 | 40.0 | 33.9 | 47.3 | 41.3 | 46.3 | 42.3 |
Kosmos-2 | 1.3B | 28.3 | 15.2 | 4.71 | 26.7 | 12.5 | 12.7 | 17.0 |
RegionVILA | 7B* | 30.8 | 47.6 | 35.8 | 44.6 | 35.5 | 49.0 | 40.4 |
SmolRGPT | 600M | 71.6 | 49.5 | 67.9 | 74.1 | 51.9 | 79.0 | 65.6 |
SpatialRGPT | 7B* | 99.1 | 99.0 | 79.2 | 89.2 | 83.6 | 87.2 | 89.8 |
SpatialRGPT-Depth | 7B* | 99.1 | 99.0 | 80.1 | 91.9 | 87.5 | 91.8 | 91.7 |
Method | Parameters | Direct Distance | Horizontal Dist. | Vertical Dist. | Width | Height | Direction |
---|---|---|---|---|---|---|---|
GPT-4 | 1.76T | 21.6 | 11.5 | 33.0 | 52.3 | 48.1 | 34.6 |
GPT-4V | 1.76T | 29.7 | 25.4 | 33.0 | 51.1 | 68.4 | 43.9 |
LLaVA-v1.6-34B | 34B | 24.3 | 24.5 | 30.4 | 30.8 | 42.8 | 33.6 |
GPT-4V+SoM | 1.76T | 25.7 | 22.1 | 33.9 | 45.8 | 62.4 | 54.2 |
LLaVA-v1.6-34B+SoM | 34B | 12.8 | 20.4 | 11.3 | 9.02 | 7.52 | 11.3 |
Kosmos-2 | 1.3B | 4.05 | 4.91 | 18.9 | 3.01 | 3.10 | 3.82 |
RegionVILA | 7B* | 22.3 | 24.6 | 17.9 | 36.8 | 49.6 | 35.5 |
SmolRGPT | 600M | 35.8 | 18.3 | 33.9 | 18.05 | 20.3 | 35.5 |
SpatialRGPT | 7B* | 35.1 | 59.0 | 53.8 | 51.9 | 54.9 | 95.3 |
SpatialRGPT-Depth | 7B* | 41.2 | 65.6 | 51.9 | 49.6 | 57.9 | 95.3 |
@article{traore2025smolrgptefficientspatialreasoning,
title={SmolRGPT: Efficient Spatial Reasoning for Warehouse Environments with 600M Parameters},
author={Abdarahmane Traore and Éric Hervet and Andy Couturier},
year={2025},
eprint={2509.15490},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2509.15490}
}