Structured Latent Variable Models for Articulated Object Interaction
Published:
This paper explores how a robot can learn a low-dimensional representation of doors from videos of them opening or closing, enabling the inference of door-related parameters and interaction outcomes. Instead of relying solely on labeled datasets, the study employs a semi-supervised approach using the Neural Statistician, a structured latent variable model that separates shared context-level variables (common across all images of the same door) from instance-level variables (specific to each image). The model effectively generates realistic door image embeddings, which outperform context-free baselines in tasks such as predicting door parameters and optimizing actions in a visual bandit door-opening scenario, demonstrating its utility for more efficient and accurate robotic interaction.
Recommended citation: Structured Latent Variable Models for Articulated Object Interaction. Emily Liu, Michael Noseworthy, Nicholas Roy. 2023.
Download Paper
