AIML Special Presentation: Active Perception and Reasoning in Open Worlds
- Date: Tue, 9 Dec 2025, 10:30 am - 11:30 am
- Location: AIML
- Dr Shijie Li Scientist at the Agency for Science, Technology and Research (A*STAR) in Singapore
Abstract: Building intelligent systems that can perceive, reason, and act in open worlds remains a grand challenge. In this talk, I will share our journey toward active perception鈥攆rom unifying 2D vision-language understanding to structured 3D reasoning and high-level foresight.
We begin with 2D perception, rethinking visual tokenization and cognitive reasoning in multimodal models to move beyond recognition toward interpretable understanding. Progressing into 3D, we explore how agents can perceive and reason about spatial structures in the physical world, grounding language in geometry and learning through self-driven curiosity. Finally, we advance toward high-level imagination and foresight, empowering models to infer unseen structures, anticipate future events, and reason causally about dynamic environments.
Together, these efforts bridge perception, reasoning, and imagination鈥攑aving the way toward intelligent agents capable of understanding and interacting with complex, ever-changing real-world environments.