CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition
Deepti B. Hegde, Jeya Maria Jose Valanarasu, Vishal Patel
1st Workshop on Open-Vocabulary 3D Scene Understanding
13:20 - 13:30 | Welcome & Introduction | |
13:30 - 14:00 | Keynote: Jen Jen Chung | Getting Robots to Touch Things Appropriately |
14:00 - 14:30 | Keynote: Vishal Patel | CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition |
14:30 - 14:45 | Oral Sessions / Challenge Winners | |
14:45 - 15:15 | Keynote: Thomas Funkhouser | |
15:15 - 16:00 | Poster Session & Coffee Break | |
16:00 - 16:30 | Keynote: Angela Dai | |
16:30 - 17:00 | Keynote: Manolis Savva | 3D Simulation for Embodied AI: Emerging Challenges and Opportunities |
17:00 - 17:30 | Panel Discussion |
Professor Vishal Patel
Johns Hopkins University
Professor Angela Dai
Technical University of Munich
Professor Manolis Savva
Simon Fraser University
Professor Thomas Funkhouser
Google / Princeton University
Professor Jen Jen Chung
University of Queensland
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition
Deepti B. Hegde, Jeya Maria Jose Valanarasu, Vishal Patel
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP
Junbo Zhang, Runpei Dong, Kaisheng Ma
The Change You Want to See (Now in 3D)
Ragav Sachdeva, Andrew Zisserman
Learning to Prompt CLIP for Monocular Depth Estimation: Exploring the Limits of Human Language
Dylan Auty, Krystian Mikolajczyk
SAM3D: Segment Anything in 3D Scenes
Yunhan Yang, Xiaoyang Wu, Tong He, Hengshuang Zhao, Xihui Liu
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Antonin Vobecky, Oriane Siméoni, David Hurych, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic
OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data
Shiyang Lu, Haonan Chang, Eric P. Jing, Yu Wu, Abdeslam Boularias, Kostas Bekris
Rank | Team | Method | mAP (↑) | AP_50 (↑) | AP_25 (↑) |
1 | PICO-MR
|
- | 6.08 | 14.08 | 17.67 |
2 | VinAI-3DIS |
GitHub | 4.13 | 12.14 | 39.41 |
3 | MSL | - | 2.67 | 5.06 | 13.98 |