I am an incoming CS Ph.D. student at the University of Southern California, co-advised by Prof. Yue Wang and Prof. Daniel Seita. Previously, I was an AI Resident at the FPT AI Center. I graduated with a bachelor's degree in Computer Science at Ho Chi Minh City University of Science. These days, my research interest lies in the intersection of Robotics, Multimodal Learning, and Generative Modeling.
We introduce a novel diffusion model incorporating the new concept of negative prompt guidance learning to tackle the task of 6-DoF grasp detection in cluttered point clouds.
We introduce HabiCrowd, a new dataset and benchmark for crowd-aware visual navigation that surpasses other benchmarks in terms of human diversity and computational utilization.
We address the task of language-driven affordance-pose detection in 3D point clouds. Our method simultaneously detect open-vocabulary affordances and generate affordance-specific 6-DoF poses.
Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation Tuan Vo,
Minh Nhat Vu,
Baoru Huang,
Tien Toan Nguyen,
Ngan Le,
Thieu Vo,
Anh Nguyen IEEE International Conference on Robotics and Automation (ICRA), 2024 arXiv |
Code
We introduce a new open-vocabulary affordance detection method using knowledge distillation and text-point correlation.
We introduce Language-Driven Scene Synthesis task, which involves the leverage of human-input text prompts to generate physically plausible and semantically reasonable objects.