arXiv Analytics

Sign in

arXiv:1703.08840 [cs.LG]AbstractReferencesReviewsResources

Inferring The Latent Structure of Human Decision-Making from Raw Visual Inputs

Yunzhu Li, Jiaming Song, Stefano Ermon

Published 2017-03-26Version 1

The goal of imitation learning is to match example expert behavior, without access to a reinforcement signal. Expert demonstrations provided by humans, however, often show significant variability due to latent factors that are not explicitly modeled. We introduce an extension to the Generative Adversarial Imitation Learning method that can infer the latent structure of human decision-making in an unsupervised way. Our method can not only imitate complex behaviors, but also learn interpretable and meaningful representations. We demonstrate that the approach is applicable to high-dimensional environments including raw visual inputs. In the highway driving domain, we show that a model learned from demonstrations is able to both produce different styles of human-like driving behaviors and accurately anticipate human actions. Our method surpasses various baselines in terms of performance and functionality.

Related articles: Most relevant | Search more
arXiv:2005.02435 [cs.LG] (Published 2020-05-05)
Effect of The Latent Structure on Clustering with GANs
arXiv:2012.02785 [cs.LG] (Published 2020-12-04)
Unsupervised embedding of trajectories captures the latent structure of mobility
arXiv:2403.13219 [cs.LG] (Published 2024-03-20)
Diffusion Model for Data-Driven Black-Box Optimization