AffordanceLLM: Grounding Affordance from Vision Language Models
Shengyi Qian, Weifeng Chen, Min Bai, Xiong Zhou, Zhuowen Tu, Li Erran Li
2nd Workshop on Open-Vocabulary 3D Scene Understanding
13:30 - 13:45 | Welcome & Introduction | |
13:45 - 14:15 | Keynote 1 | Kristen Grauman (Uni. of Texas at Austin) |
14:15 - 14:45 | Keynote 2 | Chung Min Kim, Justin Kerr (UC Berkeley) |
14:45 - 15:00 | Winner Presentations | Track 1: VinAI-3DIS Track 2: PICO-MR (2024) |
15:00 - 15:45 | Poster Session & Coffee Break | |
15:45 - 16:15 | Keynote 3 | Jiajun Wu (Stanford University) |
16:15 - 16:45 | Keynote 4 | Dave Gausebeck (Matterport) |
16:45 - 17:00 | Closing |
Please check this page out for an overview of last year's challenge results. We have also published a technical report providing an overview of our ICCV 2023 workshop challenge.
Our workshop challenge is proudly supported by:
Kristen Grauman
University of Texas at Austin
Jiajun Wu
Stanford University
Chung Min Kim
University of California, Berkeley
Justin Kerr
University of California, Berkeley
Dave Gausebeck
Matterport
AffordanceLLM: Grounding Affordance from Vision Language Models
Shengyi Qian, Weifeng Chen, Min Bai, Xiong Zhou, Zhuowen Tu, Li Erran Li
Zero-shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance
Segmentation
Tri Ton, Ji Woo Hong, SooHwan Eom, Jun Yeop Shim, Junyeong Kim, Chang D. Yoo
Auto-Vocabulary Segmentation for LiDAR Points
Weijie Wei, Osman Γlger, Fatemeh Karimi Nejadasl, Theo Gevers, Martin R. Oswald
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and
Open-Set Relationships
Sebastian Koch, Narunas Vaskevicius, Mirco Colosi, Pedro Hermosilla Casajus, Timo Ropinski
ODIN: A Single Model for 2D and 3D Segmentation
Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios, Adam Harley, Gabriel Sarch, Kriti Aggarwal,
Vishrav Chaudhary, Katerina
Fragkiadaki
QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding
Kumaraditya Gupta, Rohit Jayanti, Yash Mehan, Anirudh Govil, Sourav Garg, Madhava Krishna
Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man, Liang-Yan Gui, Yu-Xiong Wang
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene
Representation
Taisei Hanyu, Kashu Yamazaki, Benjamin R Runkle, Ngan Le