SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Baoxiong Jia, Yixin Chen, Huangyue Yu, Yan Wang, Xuesong Niu, Tengyu Liu, Qing Li, Siyuan Huang
3rd Workshop on
Open-Vocabulary 3D Scene
Understanding
14:00 - 14:15 | Welcome & Introduction |
14:15 - 14:45 | Keynote 1 Tim Meinhardt (NVIDIA) |
14:45 - 15:15 | Keynote 2 Or Litany (Technion) |
15:15 - 15:30 | Spotlight Mihai Dusmanu (Microsoft) |
15:30 - 16:30 | Poster Session & Coffee Break |
16:30 - 17:00 | Keynote 3 Alex Bewley (Google) |
17:00 - 17:30 | Keynote 4 Krishna Murthy (Meta) |
16:45 - 17:30 | Concluding Remarks |
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Baoxiong Jia, Yixin Chen, Huangyue Yu, Yan Wang, Xuesong Niu, Tengyu Liu, Qing Li, Siyuan Huang
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu, Zhuofan Zhang, Xiaojian Ma, Xuesong Niu, Yixin Chen, Baoxiong Jia, Siyuan Huang, Qing Li
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang, Xiaoyang Wu, Xi CHEN, Hengshuang Zhao, Lei Zhu, Joan Lasenby
Task-oriented Sequential Grounding in 3D Scenes
Zhuofan Zhang, Ziyu Zhu, Pengxiang Li, Tengyu Liu, Xiaojian Ma, Yixin Chen, Baoxiong Jia, Siyuan Huang, Qing Li
Space3D-Bench: Spatial 3D Question Answering Benchmark
Emilia Szymanska, Mihai Dusmanu, Mahdi Rad, Marc Pollefeys
This website is licensed under a Creative
Commons Attribution-ShareAlike 4.0 International License.
It borrows the source code of this website.
We would like to thank Utkarsh Sinha and Keunhong Park.