Meta-training neural networks to control themselves

OxTalks is Changing

OxTalks will soon be transitioning to Oxford Events (full details are available on the Staff Gateway). A two-week publishing freeze is expected in early Hilary to allow all events to be migrated to the new platform. During this period, you will not be able to submit or edit events on OxTalks. The exact freeze dates will be confirmed as soon as possible.

If you have any questions, please contact halo@digital.ox.ac.uk

Meta-training neural networks to control themselves
Animals learn to adapt to levels of uncertainty in the environment by monitoring errors and engaging control processes. Recently, deep networks have been proposed as theories of animal perception, cognition and learning, but there is theory that allows us to incorporate error monitoring or control into neural networks. Here, we asked whether it was possible to meta-train deep RL agents to adapt to the level of controllability of the environment. We found that this was only possible if we encouraged them to compute action prediction errors – error signals similar to those generated in mammalian medial PFC. APE-trained networks meta-learned policies in an “observe vs. bet” bandit task that closely resembled those of humans. We also show that biases in this error computation lead the network to display pathologies of control characteristic of psychological disorders, such as compulsivity and learned helplessness.
Date: 16 May 2024, 14:30
Venue: Sherrington Building, off Parks Road OX1 3PT
Venue Details: Blakemore Lecture Theatre
Speakers: Chris Summerfield (University of Oxford), Kai Sandbrink (University of Oxford)
Organising department: Medical Sciences Division
Organiser: Dr Rui Ponte Costa (University of Oxford)
Part of: Oxford NeuroAI Forum
Booking required?: Not required
Audience: Members of the University only
Editor: Rui Costa