arXiv Analytics

Sign in

arXiv:2006.16225 [cs.LG]AbstractReferencesReviewsResources

Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems

Anirudh Goyal, Alex Lamb, Phanideep Gampa, Philippe Beaudoin, Sergey Levine, Charles Blundell, Yoshua Bengio, Michael Mozer

Published 2020-06-29Version 1

Modeling a structured, dynamic environment like a video game requires keeping track of the objects and their states (\emph{declarative} knowledge) as well as predicting how objects behave (\emph{procedural} knowledge). Black-box models with a monolithic hidden state often lack \emph{systematicity}: they fail to apply procedural knowledge consistently and uniformly. For example, in a video game, correct prediction of one enemy's trajectory does not ensure correct prediction of another's. We address this issue via an architecture that factorizes declarative and procedural knowledge and that imposes modularity within each form of knowledge. The architecture consists of active modules called \emph{object files} that maintain the state of a single object and invoke passive external knowledge sources called \emph{schemata} that prescribe state updates. To use a video game as an illustration, two enemies of the same type will share schemata but will each have their own object file to encode their distinct state (e.g., health, position). We propose to use attention to control the determination of which object files to update, the selection of schemata, and the propagation of information between object files. The resulting architecture is a drop-in replacement conforming to the same input-output interface as normal recurrent networks (e.g., LSTM, GRU) yet achieves substantially better generalization on environments that have factorized declarative and procedural knowledge, including a challenging intuitive physics benchmark.

Comments: Under Review, NeurIPS 2020
Categories: cs.LG, stat.ML
Related articles: Most relevant | Search more
arXiv:1906.08720 [cs.LG] (Published 2019-06-20)
Boosting for Dynamical Systems
arXiv:2312.09234 [cs.LG] (Published 2023-12-14)
Let's do the time-warp-attend: Learning topological invariants of dynamical systems
arXiv:1912.10382 [cs.LG] (Published 2019-12-22)
Deep Learning via Dynamical Systems: An Approximation Perspective