[ICLR 2026] 🦅 FALCON: an effective vision-language-action model injects rich 3D spatial tokens into the action head, enabling robust spatial understanding and SOTA performance across diverse manipulation tasks.

vla iclr generalist-robot-policies vision-language-action-model spatial-understanding iclr2026

Updated May 26, 2026
Python

turingmotors / STRIDE-QA-Dataset

Star

[AAAI 2026 Oral] STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes

autonomous-driving visual-question-answering vision-language-models spatial-understanding

Updated Jan 23, 2026
Python

MatchLab-Imperial / POMA-3D

Star

POMA-3D: The Point Map Way to 3D Scene Understanding.

3d-representation-learning 3d-scene-understanding spatial-understanding

Updated Nov 9, 2025

Xiaohao-Xu / Ambiguity-in-Space

Star

[ECCV 2026] One Scene, Two Depths: Probing Geometric Ambiguity in Monocular Foundation Models (Layered 3D Spatial Understanding)

representation depth-estimation interpretability eccv latent-space foundation-models visual-prompting spatial-intelligence multi-layer-depth spatial-understanding eccv2026 model-bias geometric-ambiguity depth-layer-preference spatial-prompting

Updated Jun 30, 2026
Python

bytedance / SIFThinker

Star

SIFThinker: Spatially-Aware Image Focus for Visual Reasoning

research reinforcement-learning mllms spatial-understanding

Updated Dec 2, 2025
Python

kcsayem / handvqa

Star

[CVPR 2026] HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models

benchmark vlm spatial-understanding

Updated Apr 30, 2026
Python

ashwin-ned / autoregressive-mosaics

Star

Autoregressive Mosaics is a project that attempts to force an LLM trained only on text to paint a picture one discrete pixel at a time.

ai-art spatial-understanding autoregressive-llms

Updated May 24, 2026
HTML

Improve this page

Add a description, image, and links to the spatial-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spatial-understanding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spatial-understanding

Here are 16 public repositories matching this topic...

google-deepmind / tips

cambrian-mllm / cambrian-s

InternRobotics / G2VLM

NVlabs / SpatialClaw

Yangr116 / VST

InternLM / Spatial-SSRL

blurgyy / CoMPaSS

LiuHengyu321 / IR3D-Bench

NVlabs / 4D-RGPT

FALCON-VLA / FALCON

turingmotors / STRIDE-QA-Dataset

MatchLab-Imperial / POMA-3D

Xiaohao-Xu / Ambiguity-in-Space

bytedance / SIFThinker

kcsayem / handvqa

ashwin-ned / autoregressive-mosaics

Improve this page

Add this topic to your repo