Generative Computer Vision for the Physical World

Talk

Ruoshi Liu

Time:

03.12.2025 13:30 to 14:30

Location:

IRB 4105 or https://umd.zoom.us/j/94340703410?pwd=rrXaGSXSpabcMTtDNmeCNf2Ih2fQYE.1

URL:

https://talks.cs.umd.edu/talks/4145

Generative models are revolutionizing our world, with the ability to generate photorealistic visual content that are indistinguishable from reality. Despite their overwhelming presence in the cyber world, they haven’t been very useful in the physical world that we live in. In this talk, I will present how the rich priors learned by large-scale generative models—ranging from shape and geometry to motion and dynamics—can be harnessed for real-world perception and interaction tasks. I will showcase how these models can facilitate tasks like 3D reconstruction and robotic manipulation by incorporating the structure of the physical world. Moreover, I will discuss methods to further refine and adapt these systems through self-learning, enabling machines to continually improve as they explore new scenarios and environments. Together, these breakthroughs build the foundation for my vision of creating self-supervised machines that can perceive and interact with the physical world.

Generative Computer Vision for the Physical World

Talk

Talk

Talk

Talk

Talk

Event

Event

Event

Event

Event