Science Cast

Sensorimotor World Models: Perception for Action via Inverse Dynamics

librarianJune 19, 2026 2:14am

Views (1)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Sensorimotor World Models: Perception for Action via Inverse Dynamics

arXivPDFJune 18, 2026 12:00am

Authors

Petr Ivashkov, Randall Balestriero, Bernhard Schölkopf

Abstract

Perception for action suggests that representations of the world should be shaped not by visual fidelity alone, but by their relevance for actions. At the same time, latent JEPA-style world models advocate learning compact predictive states from high-dimensional observations to facilitate the prediction of future states, but end-to-end training of these models is nontrivial because representations may collapse if our only goal is to construct a latent state that is easy to predict. We introduce a sensorimotor world model (SMWM): a latent world model trained end-to-end with inverse dynamics regularization. This single regularizer addresses both issues: it prevents representation collapse and induces action-aligned representations. By forcing latent states to preserve information about the action underlying a transition, it biases the model toward the controllable degrees of freedom of the environment while discarding uncontrollable distractors. This yields stable latent world models trained from offline, reward-free trajectories, without frozen encoders, exponential moving averages, or complex latent regularizers. Empirically, SMWM learns compact, interpretable latent spaces and enables competitive planning performance across simple 2D and 3D control tasks.

TwitterandLinkedIn

0 comments

Add comment

Sensorimotor World Models: Perception for Action via Inverse Dynamics

Sensorimotor World Models: Perception for Action via Inverse Dynamics

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments