5 minute read

This post is based on the final chapter of the textbook “Active Inference” by Thomas Parr and Giovanni Pezzulo.


The Active Inference textbook by Thomas Parr, Giovanni Pezzulo, and Karl Friston

1. Introduction

Active Inference offers a new perspective that integrates previously separate concepts in psychology and neuroscience, such as perception, action selection, attention, and emotion regulation. Furthermore, it connects closely with existing frameworks like cybernetics, the ideomotor theory of action, reinforcement learning, and optimal control. Active Inference can also extend to biological, social, and technological domains.

The Low Road

A cat quickly reacting to a snake’s attack, anticipating the threat. Source: forgif

All sentient beings exist within a loop of action and perception with their environment. One way to understand Active Inference is by considering the brain as a prediction machine equipped with a generative model of the world. This idea, originating from Hermann von Helmholtz, has evolved into the Bayesian Brain Hypothesis. According to this view, exact inference is impossible, leading to approximations where perception and action minimize variational free energy. In other words, action and perception are not independent concepts but two sides of the same process aimed at achieving a goal. Planning for future events can also be seen as a process of minimizing expected free energy.

The High Road

Another way to understand Active Inference starts with the core principle that biological organisms maintain their bodies and prevent their demise. The Markov blanket, a concept commonly used in statistics, provides a formal framework that distinguishes between an organism’s internal and external states. An organism with a Markov blanket can model the external environment from a Bayesian perspective. Autonomy is ensured because the organism’s model of the external environment sets ontological preconditions without bias. Optimal behavior can be described as a process of maximizing Bayesian model evidence through perception and action. This process is equivalent to minimizing variational free energy and aligns with Hamilton’s principle of least action and principles from statistical mechanics.

Formalization

From a formal perspective, free energy is minimized through variational inference. The processes represented by a model can be modeled regardless of whether variables and time are discrete or continuous.

Process Theory

Active Inference is not just a normative principle but also provides a process theory that analyzes how these principles are implemented in the brain. It explains neural message passing, including the neuroanatomical circuitry and neuromodulation, from the anatomical level to the systemic level.

Design Recipe

All organisms minimize variational free energy but do so differently because they have different generative models. The repertoire of possible generative models is rich and depends on the context and ecological niche, resulting in different biological implementations. Evolution has discovered complex forms of brains and bodies suited to diverse ecological niches, from bacteria to primates. When designing models, this process can be reverse-engineered, allowing for the design of models with discrete or continuous variables and varying depths depending on the brains and bodies of interest.

Example

Active Inference can be used to model perceptual inference, goal-directed behavior, model learning, action selection, and more.

Experimentation

Active Inference can be used for model-based data analysis on data collected from experiments, allowing for parameter back-calculation based on individual generative models. This computational phenotyping helps design and evaluate objective models of subjective models of others.


2. Connecting the Dots


Dennett’s Critique

Daniel Dennett around 2004. Source: Eamonn McCabe/Popperfoto, via Getty Images

Decades ago, philosopher Daniel Dennett complained that cognitive scientists spent too much time modeling independent subsystems that are difficult to distinguish clearly. He suggested that they should instead focus on modeling the entire cognitive creature by properly setting the environmental niche they are dealing with.

Integrative Perspective

Active Inference provides an integrative perspective. Perception and action are self-evidencing with respect to a generative model, representing different aspects of the process of minimizing variational free energy. Long-term memory develops as the generative model learns its parameters, while working memory is the process of belief updating about past and future external states. During this time, attention optimizes belief about the precision of sensory input. Planning and intentionality can be conceptualized as the ability to select among possible future scenarios, requiring a deep temporal generative model. This deep temporal model also helps understand complex phenomena like retrospection, where beliefs about the past are updated based on current beliefs, and prospection, where beliefs about the future are derived from current beliefs. Finally, interoceptive regulation and emotion can also be conceptualized through generative models of internal physiological mechanisms predicting allostatic outcomes of future events.

Vision

In response to Dennett’s critique, Active Inference offers an integrative view of many cognitive and behavioral concepts that have been studied independently. It allows us to analyze the various cognitive functions derived from the first principles that life must solve, rather than merely combining perception, action, memory, and attention. This approach gives mathematical formality and specificity, moving beyond the ambiguity of William James’ categories that have been used in psychology and neuroscience.

Identifying Analogies

The exploration-exploitation dilemma always present in decision-making. Source: Lakera AI

Adopting this normative perspective makes it easier to identify formal analogies between cognitive phenomena studied in different fields. Notably, it helps understand the connections between the exploration-exploitation dilemma and the search-deliberation dilemma in constrained resources.

Demystifying

Heart-shaped lightning bolt. Source: Reddit

In conclusion, Active Inference, as a process theory, offers a key means of understanding the computations performed by neurons. Mind-brain-behavior can be understood simply as a process of minimizing variational free energy. In other words, just as thunder and lightning are auditory and visual aspects of atmospheric discharge, mind-brain-behavior are different aspects of the process of minimizing variational free energy. Just as modern science understands thunder and lightning not by reducing them to each other but by reducing both to atmospheric discharge, the mysteries of the relationship between brain, mind, and behavior are best understood by reducing them to the process of minimizing free energy.

Summary

This book began with the question of understanding the brain and behavior from first principles, proposing Active Inference as the solution. It explored what Active Inference means for familiar psychological concepts like perception, action selection, and emotion, while reexamining related concepts and proposing future research directions. Through practical exercises in theoretical neurobiology, it encouraged practicing Active Inference, emphasizing its applicability to everyday life. Ultimately, the book highlighted how Active Inference can contribute to both our daily lives and research, encouraging readers to continue pursuing this approach.

Comments