Underline digital video library
1,973 results

Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration
Zhixuan Shen and 4 other authors

Semantic Segmentation on Raindrop Degraded Images Using Two-Stage Dual Teacher-Student Learning
Xin Yang and 4 other authors

FloNa: Floor Plan Guided Embodied Visual Navigation
Li Jiaxin and 5 other authors

ERF: A Benchmark Dataset for Robust Semantic Segmentation Under Extreme Rainfall Conditions
Xin Yang and 2 other authors

ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
Yi Feng and 5 other authors

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
Xianqiang Gao and 6 other authors

MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration
Yishuai Cai and 6 other authors

GNN-Transformer Task Planning Enhanced with Semantic-Driven Data Augmentation
Soojin Jeong and 4 other authors

Multi-Modal Grounded Planning and Efficient Replanning for Learning Embodied Agents with a Few Examples
Taewoong Kim and 2 other authors

Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images
Hongyu Yan and 1 other author

Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation
Fangyuan Wang and 5 other authors

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation
Yiyuan Pan and 3 other authors