Selected Publications
✱Both authors contributed equally.
|
|
|
DA2: Depth Anything in Any Direction
Haodong Li,
Wangguangdong Zheng,
Jing He,
Yuhao Liu,
Xin Lin,
Xin Yang,
Ying-Cong Chen,
Chunchao Guo
ICLR 2026
arXiv
/
Paper
/
Project Page
/
Code
/
Demo
/
Data
/
Slides
Powered by large-scale training data curated from our panoramic data curation engine, and the SphereViT for addressing the spherical distortions in panoramas,
DA2 is able to predict dense, scale-invariant distance from a single 360° panorama in an end-to-end manner,
with remarkable geometric fidelity and strong zero-shot generalization.
|
|
|
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He✱,
Haodong Li✱,
Wei Yin,
Yixun Liang,
Leheng Li,
Kaiqiang Zhou,
Hongbo Zhang,
Bingbing Liu,
Ying-Cong Chen
ICLR 2025
arXiv
/
Paper
/
Project Page
/
Code
/
Demo (D)
/
Demo (N)
/
ComfyUI
Lotus is a diffusion-based visual foundation model with a simple yet effective adaptation protocol,
aiming to fully leverage the pre-trained diffusion's powerful visual priors for dense prediction.
With minimal training data, Lotus achieves SoTA performance in two key geometry perception tasks, i.e., zero-shot monocular depth and normal estimation.
|
Academic Service
| Reviewer: |
CVPR 2025 (1), ICLR 2026 (5), CVPR 2026 (2), ICML 2026 (6), SIGGRAPH 2026 (1), ECCV 2026 (2), NeurIPS 2026 (4). |
|
|