mirror of https://github.com/BerenMillidge/FEP_Active_Inference_Papers.git synced 2026-06-17 02:00:27 +00:00

A repository for major/influential FEP and active inference papers.

TeX 78.1%
Python 21.9%

Find a file

Beren 2ed3e401ce fix links		2020-12-07 13:13:19 +00:00
bibtex.bib	fix links	2020-12-07 13:13:19 +00:00
bibtex_to_md.py	Add files via upload	2020-12-07 13:02:14 +00:00
LICENSE	Initial commit	2020-12-05 16:09:17 +00:00
README.md	fix links	2020-12-07 13:13:19 +00:00

README.md

FEP and Active Inference Paper Repository

This repository provides a list of papers that I believe are interesting and influential on the Free-Energy-Principle, or in Active Inference. If you believe I have missed any papers, please contact me at beren@millidge.name or make a pull request with the information about the paper. I will be happy to include it.

FEP Outline

This list is of papers focused specifically on the abstract mathematical formulation of the Free-Energy-Principle (FEP). The FEP is a theory which tries to determine the behaviours a non-equilibrium thermodynamical system must exhibit if it is to maintain itself as a separate entity over time. It argues that any such system must minimize a quantity called the free energy and that, over the course of this minimisation, behaviour much like action and perception must emerge.

The key prerequisites for the FEP is that a 'system' has a special kind of statistical separation from the world called a Markov Blanket, which it must maintain if it is to remain a system, and that the system possesses a non-equilibrium steady state which it self-organises to and tries to maintain over time against the dissipative forces of entropy.

Much of the work in the FEP has been applying its general tenets to understand biological far-from-equilibrium systems, especially the brain.

If you are just starting out, I reccomend reading all the papers in the 'Survey' section in order. These are all great tutorials or overviews which should give you a great grounding in the intuitions of the theory, and then the later two tutorials should start building up much of the mathematical core of the theory (especially around predictive coding).

Surveys

What does the free energy principle tell us about the brain? , (2019) by Gershman, Samuel J [bib]

This provides a great high level introduction to the basic ideas and intuitions of the FEP, with a small amount of crucial mathematical background.

The free-energy principle: a unified brain theory? , (2010) by Friston, Karl [bib]

This provides a great overview for the initial intuitions behind the FEP and its application to the brain.

A tutorial on the free-energy framework for modelling perception and learning , (2017) by Bogacz, Rafal [bib]

This is a great review which introduces the basics of predictive coding and the FEP, including the maths and contains MATLAB sample code. If you want to start seriously diving into the maths, I would start here.

The free energy principle for action and perception: A mathematical review , (2017) by Buckley, Christopher L, Kim, Chang Sub, McGregor, Simon and Seth, Anil K [bib]

This is a fantastic review which presents a complete walkthrough of the mathematical basis of the Free Energy Principle and Variational Inference, and derives predictive coding and (continuous time and state) active inference. I would reccomend reading this after Bogacz' tutorial (although be prepared -- it is a long and serious read)

Classics

A free energy principle for a particular physics , (2019) by Friston, Karl [bib]

This is Karl's magisterial monograph, and contains the most comprehensive description of the FEP to date

A free energy principle for the brain , (2006) by Friston, Karl, Kilner, James and Harrison, Lee [bib]

Perhaps the earliest paper describing the FEP. Provides a great description of the fundamental intuitions behind the theory (in needs of living systems to reduce their internal entropy to keep conditions within homeostatic bounds)

A theory of cortical responses , (2005) by Friston, Karl [bib]

An early but complete description of predictive coding as an application of the FEP and variational inference under Gaussian and Laplace assumptions. Also surprisingly readable. This is core reading on predictive coding and the FEP

Learning and inference in the brain , (2003) by Friston, Karl [bib]
Reinforcement learning or active inference? , (2009) by Friston, Karl J, Daunizeau, Jean and Kiebel, Stefan J [bib]

The earliest paper (I think) on active inference. Introduces the motivation behind the continuous state and time formulation of active inference. Shows how predictive coding can be used to learn actions as well as observations (by treating them the same)

Action understanding and active inference , (2011) by Friston, Karl, Mattout, J{'e}r{'e}mie and Kilner, James [bib]

Goes deep into the neuroscientific intuitions behind why you might want to think about action as a predicted observation and not a latent variable for biological brains. Presents Karl's view that action happens primarily at the periphery through simple 'reflex arcs' while all the real work is done by the generative models generating predictions.

A free energy principle for biological systems , (2012) by Karl, Friston [bib]
Of woodlice and men , (2018) by Fortier, Martin and Friedman, Daniel A [bib]

A great interview with Karl. Goes into a lot of his personal motivations underlying his work on the FEP. I would recommend this perhaps as an initial place to start out if you know nothing of the FEP to grasp the underlying motivations of what it is trying to explain.

The history of the future of the Bayesian brain , (2012) by Friston, Karl [bib]
Free energy, value, and attractors , (2012) by Friston, Karl and Ao, Ping [bib]

Mathematical paper by Karl and Ping Ao which begins fleshing out formally the notion of desires as attractors

Attention, uncertainty, and free-energy , (2010) by Feldman, Harriet and Friston, Karl [bib]

Makes a conjectured link between precision in predictive coding and attention in the brain.

Hierarchical models in the brain , (2008) by Friston, Karl [bib]

Presents the 'full-construct' predictive coding model with both hierarchies and generalised coordinates.

DEM: a variational treatment of dynamic systems , (2008) by Friston, Karl J, Trujillo-Barreto, N and Daunizeau, Jean [bib]

Extends predictive coding to generalised coordinates, and derives the necessary inference algorithms for working with them -- i.e. DEM, dynamic expectation maximisation.

Generalised filtering , (2010) by Friston, Karl, Stephan, Klaas, Li, Baojuan and Daunizeau, Jean [bib]
Surfing uncertainty: Prediction, action, and the embodied mind , (2015) by Clark, Andy [bib]
Variational filtering , (2008) by Friston, Karl J [bib]

Foundational treatment of variational inference for dynamical systems, as represented in generalised coordinates. Also relates variational filtering to other non-variational schemes like particle filtering and Kalman filtering.

Philosophical Analyses

A tale of two densities: Active inference is enactive inference , (2020) by Ramstead, Maxwell JD, Kirchhoff, Michael D and Friston, Karl J [bib]
Answering Schr{"o}dinger's question: A free-energy formulation , (2018) by Ramstead, Maxwell James D{'e}sormeau, Badcock, Paul Benjamin and Friston, Karl John [bib]
Thinking through other minds: A variational approach to cognition and culture , (2020) by Veissi{`e}re, Samuel PL, Constant, Axel, Ramstead, Maxwell JD, Friston, Karl J and Kirmayer, Laurence J [bib]
TTOM in action: Refining the variational approach to cognition and culture , (2020) by Veissi{`e}re, Samuel PL, Constant, Axel, Ramstead, Maxwell JD, Friston, Karl J and Kirmayer, Laurence J [bib]
What does the free energy principle tell us about the brain? , (2019) by Gershman, Samuel J [bib]

This provides a great high level introduction to the basic ideas and intuitions of the FEP, with a small amount of crucial mathematical background.

The anticipating brain is not a scientist: the free-energy principle from an ecological-enactive perspective , (2018) by Bruineberg, Jelle, Kiverstein, Julian and Rietveld, Erik [bib]
Predictive processing and the representation wars , (2018) by Williams, Daniel [bib]
Whatever next? Predictive brains, situated agents, and the future of cognitive science , (2013) by Clark, Andy [bib]
Predictions in the eye of the beholder: an active inference account of Watt governors , (2020) by Baltieri, Manuel, Buckley, Christopher L and Bruineberg, Jelle [bib]
From allostatic agents to counterfactual cognisers: active inference, biological regulation, and the origins of cognition , (2020) by Corcoran, Andrew W, Pezzulo, Giovanni and Hohwy, Jakob [bib]
Interoceptive inference, emotion, and the embodied self , (2013) by Seth, Anil K [bib]
Active interoceptive inference and the emotional brain , (2016) by Seth, Anil K and Friston, Karl J [bib]
The cybernetic Bayesian brain , (2014) by Seth, Anil K [bib]
Presence, objecthood, and the phenomenology of predictive perception , (2015) by Seth, Anil K [bib]

Self-Organisation and Markov Blankets

Life as we know it , (2013) by Friston, Karl [bib]
Knowing one's place: a free-energy approach to pattern regulation , (2015) by Friston, Karl, Levin, Michael, Sengupta, Biswa and Pezzulo, Giovanni [bib]
Morphogenesis as Bayesian inference: A variational approach to pattern formation and control in complex biological systems , (2019) by Kuchling, Franz, Friston, Karl, Georgiev, Georgi and Levin, Michael [bib]
Neural and phenotypic representation under the free-energy principle , (2020) by Ramstead, Maxwell JD, Hesp, Casper, Tschantz, Alexander, Smith, Ryan, Constant, Axel and Friston, Karl [bib]
Parcels and particles: Markov blankets in the brain , (2020) by Friston, Karl J, Fagerholm, Erik D, Zarghami, Tahereh S, Parr, Thomas, Hip{'o}lito, In{^e}s, Magrou, Lo{"\i}c and Razi, Adeel [bib]
Markov blankets in the brain , (2020) by Hipolito, Ines, Ramstead, Maxwell, Convertino, Laura, Bhat, Anjali, Friston, Karl and Parr, Thomas [bib]
Modules or Mean-Fields? , (2020) by Parr, Thomas, Sajid, Noor and Friston, Karl J [bib]
Biological self-organisation and Markov blankets , (2017) by Palacios, Ensor Rafael, Razi, Adeel, Parr, Thomas, Kirchhoff, Michael and Friston, Karl [bib]

Information Geometry

Markov blankets, information geometry and stochastic thermodynamics , (2020) by Parr, Thomas, Da Costa, Lancelot and Friston, Karl [bib]

Active Inference Outline

Active Inference is a process theory of neurobiological function inspired by and closely related to the FEP. However Active Inference stands independent of the FEP and can be true even if the FEP is not, and similarly can potentially be falsified without impacting the FEP. The core idea behind Active Inference is the idea that the brain performs both action and perception by variational inference on a unified objective function.

In effect, the key idea behind active inference is that our brains possess powerful probabilistic generative models and inference engines, and that to select actions, we repurpose this powerful capacity we use for perception to also infer potential actions. Hence Active Inference.

This high-level description leaves open the exact type of models and inference being used for action inference in the brain. The active inference literature contains three clear strands of work, which correspond to different assumptions on the exact form of generative models which are proposed to be utilized by the brain. Discrete active inference focuses on models of discrete state-spaces parametrised by categorical distributions and transition matrices. Continuous active inference focuses on the continuous time case with (generally) linear dynamics, and Deep active inference focuses on using deep neural networks to 'scale up' active inference by amortising probabilistic distributions with learned maps. The discrete-state-space work has close similarities with bandit-problems and neuroscience tasks and forms a tractable test-bed to understand different kinds of behaviour. Most of the work of creating active inference models of brain function (or dysfunction) lies within this paradigm. Continuous active inference, which is being used for robot control, has close links to classical control theory, while Deep active inference has close links with reinforcement learning and machine learning.

The task of inferring actions (requiring detailed models of future outcomes given these actions), is a subtly more complex task than simply inferring the immediate causes of sensory data as in perceptual inference. It therefore requires different objective functionals (the expected free energy) and potentially more advanced message-passing inference algorithms. This work is summarised in the 'Message Passing and Free Energies' section.

Surveys and Tutorials

Active inference on discrete state-spaces: a synthesis , (2020) by Da Costa, Lancelot, Parr, Thomas, Sajid, Noor, Veselic, Sebastijan, Neacsu, Victorita and Friston, Karl [bib]

This is a great and thorough tutorial on discrete-state-space active inference. I would reccomend it to everybody new to the field.

Discrete State Space Formulation

Active inference and epistemic value , (2015) by Friston, Karl, Rigoli, Francesco, Ognibene, Dimitri, Mathys, Christoph, Fitzgerald, Thomas and Pezzulo, Giovanni [bib]

Introduces the main intuitions behind active inference, as well as the crucial epistemic foraging behaviour of the expected free energy. Illustrated on a simple T-maze task.

Active inference and learning , (2016) by Friston, Karl, FitzGerald, Thomas, Rigoli, Francesco, Schwartenbeck, Philipp, Pezzulo, Giovanni and others [bib]
Active inference and agency: optimal control without cost functions , (2012) by Friston, Karl, Samothrakis, Spyridon and Montague, Read [bib]

The first (I think) discrete-state-space paper on active inference. Notable for using the standard variational free energy as objective function and not the expected free energy. Describes some of the intuitions behind active inference.

Active inference: a process theory , (2017) by Friston, Karl, FitzGerald, Thomas, Rigoli, Francesco, Schwartenbeck, Philipp and Pezzulo, Giovanni [bib]

Provides a very good and thorough description of discrete-state-space active inference and ties its updates closely to neural physiology. I would reccomend this after the Da Costa introduction.

Uncertainty, epistemics and active inference , (2017) by Parr, Thomas and Friston, Karl J [bib]
Deep temporal models and active inference , (2018) by Friston, Karl J, Rosch, Richard, Parr, Thomas, Price, Cathy and Bowman, Howard [bib]
Sophisticated Inference , (2020) by Friston, Karl, Da Costa, Lancelot, Hafner, Danijar, Hesp, Casper and Parr, Thomas [bib]

Introduces the next stage of active inference. 'Sophisticated' active inference, where agents make decisions not just on their beliefs about the future, but on how their beliefs will change in the future. Allows the simulation of real epistemic value -- i.e. act so as to change your beliefs in the future.

Active inference: demystified and compared , (2019) by Sajid, Noor, Ball, Philip J and Friston, Karl J [bib]
The relationship between dynamic programming and active inference: The discrete, finite-horizon case , (2020) by Da Costa, Lancelot, Sajid, Noor, Parr, Thomas, Friston, Karl and Smith, Ryan [bib]

Discusses the relationship between active inference and dynamic programming solutions to reinforcement learning problems (i.e. Q learning, value functions etc). Shows that they are largely equivalent except with different objectives (Expected Free Energy vs Expected Discounted Reward).

Continuous Time Formulation

Reinforcement learning or active inference? , (2009) by Friston, Karl J, Daunizeau, Jean and Kiebel, Stefan J [bib]

An active inference implementation of phototaxis , (2017) by Baltieri, Manuel and Buckley, Christopher L [bib]

Active inference in plants!!!

PID control as a process of active inference with linear generative models , (2019) by Baltieri, Manuel and Buckley, Christopher L [bib]

Active inference under a linear gaussian generative model can replicate PID, but also provide a natural method for learning the tuning coefficients (by understanding them as precisions).

On Kalman-Bucy filters, linear quadratic control and active inference , (2020) by Baltieri, Manuel and Buckley, Christopher L [bib]

A key step towards understanding how active inference relates to classical control theory methods such as Kalman Filters and LQR control.

Application of the Free Energy Principle to Estimation and Control , (2019) by van de Laar, Thijs, {"O}z{\c{c}}elikkale, Ay{\c{c}}a and Wymeersch, Henk [bib]

Another approach to understanding how active inference relates to and extends classical control theory methods.

The State Space Formulation of Active Inference: Towards Brain-Inspired Robot Control , (2019) by Grimbergen, Sherin [bib]

An excellent overview and fantastic piece of work on the linear time-indepenent formulation of active inference and its relation to classical control theory.

Hierarchical active inference: A theory of motivated control , (2018) by Pezzulo, Giovanni, Rigoli, Francesco and Friston, Karl J [bib]

Message Passing and Free Energies

The graphical brain: belief propagation and active inference , (2017) by Friston, Karl J, Parr, Thomas and de Vries, Bert [bib]

Introduces the general factor-graph message passing viewpoint on active inference. Also introduces hierarchical active inference models.

Neuronal message passing using Mean-field, Bethe, and Marginal approximations , (2019) by Parr, Thomas, Markovic, Dimitrije, Kiebel, Stefan J and Friston, Karl J [bib]

Discusses in depth the different potential message passing inference algorithms which can be used to implement active inference on factor graphs.

Active inference, belief propagation, and the bethe approximation , (2018) by Schw{"o}bel, Sarah, Kiebel, Stefan and Markovi{'c}, Dimitrije [bib]

Introduces the Bethe free energy, as a result of making the Bethe approximation instead of the mean-field variational assumption to derive the message passing algorithms.

Generalised free energy and active inference , (2019) by Parr, Thomas and Friston, Karl J [bib]
Whence the Expected Free Energy? , (2020) by Millidge, Beren, Tschantz, Alexander and Buckley, Christopher L [bib]

Discusses whether we can derive the expected free energy objective function on principled ground from the FEP, and discusses different potential objective functions for active inference.

On the Relationship Between Active Inference and Control as Inference , (2020) by Millidge, Beren, Tschantz, Alexander, Seth, Anil K and Buckley, Christopher L [bib]

Discusses the relationship between Active Inference and Control as Inference, a variational framework for understanding action selection which has emerged from RL.

Active Inference for Control Theory/Robotics

Active inference and robot control: a case study , (2016) by Pio-Lopez, L{'e}o, Nizard, Ange, Friston, Karl and Pezzulo, Giovanni [bib]
Active inference body perception and action for humanoid robots , (2019) by Oliver, Guillermo, Lanillos, Pablo and Cheng, Gordon [bib]
End-to-end pixel-based deep active inference for body perception and action , (2019) by Sancaktar, Cansu and Lanillos, Pablo [bib]
Active inference for robot control: A factor graph approach , (2019) by Vanderbroeck, Mees, Baioumy, Mohamed, van der Lans, Daan, de Rooij, Rens and van der Werf, Tiis [bib]
A novel adaptive controller for robot manipulators based on active inference , (2020) by Pezzato, Corrado, Ferrari, Riccardo and Corbato, Carlos Hern{'a}ndez [bib]

Deep Active Inference

Reinforcement Learning through Active Inference , (2020) by Tschantz, Alexander, Millidge, Beren, Seth, Anil K and Buckley, Christopher L [bib]

Demonstrates that the exploration afforded by the Expected Free Energy Objective is useful in a deep reinforcement learning setting. Also maintains uncertainty through model ensembles applied in a model-based RL setting.

Scaling active inference , (2020) by Tschantz, Alexander, Baltieri, Manuel, Seth, Anil K and Buckley, Christopher L [bib]

Implements Deep Active Inference in a model-based RL setting using explicit planning with a transition model.

Deep active inference as variational policy gradients , (2020) by Millidge, Beren [bib]

Implements deep active inference in a model-free policy gradient setting by amortising the learning of the expected-free-energy value function. Uses a transition model for the state-information gain term in the expected free energy.

Deep active inference , (2018) by Ueltzh{"o}ffer, Kai [bib]

The first paper to try combining active inference with deep neural networks. Demonstrates the importance of the exploratory terms of the EFE to solve the mountain-car problem.

Deep active inference agents using Monte-Carlo methods , (2020) by Fountas, Zafeirios, Sajid, Noor, Mediano, Pedro AM and Friston, Karl [bib]

Acknowledgements

Many thanks to @conorheins for his helpful suggestions.

Contributing

To contribute, please make pull requests adding entries to the bibtex file.

The README file was generated from bibtex using the bibtex_to_md.py file. The keywords to use for each classification (Survey, Discrete-state-space etc) can be found at the bottom of the .py file.

This code and structure is heavily inspired by https://github.com/optimass/continual_learning_papers.