Reinforcement Learning in a Prisoner's Dilemma

OxTalks is Changing

OxTalks will soon be transitioning to Oxford Events (full details are available on the Staff Gateway). A two-week publishing freeze is expected to start before the end of Hilary Term to allow all future events to be migrated to the new platform. During this period, you will not be able to submit or edit events on OxTalks. The exact freeze dates will be confirmed on the Staff Gateway and via email to identified OxTalks users.

If you have any questions, please contact halo@digital.ox.ac.uk

Reinforcement Learning in a Prisoner's Dilemma

I fully characterize the outcomes of a wide class of model-free reinforcement learning algorithms in a prisoner’s
dilemma. The behavior is studied in the limit as players explore their options sufficiently and eventually stop experimenting.
Whether the players learn to cooperate or defect can be determined in a closed form from the relationship between the learning rate and the payoffs of the game. The results generalize to asymmetric learners and many experimentation rules with implications for the issue of algorithmic collusion.

Zoom link: us02web.zoom.us/j/83496520603
18 January 2022, 12:00-12:45pm (Tuesday)

To sign up for a 30-minute meeting with the speaker, please add your name at this link: docs.google.com/spreadsheets/d/1Ux9g5nXtbFmqIA3DWZYUljZthkvRd-Qg/edit#gid=1190663177

Date: 18 January 2022, 12:00
Venue: Online
Speaker: Dr Arthur Dolgopolov (European University Institute)
Organising department: Department of Economics
Organiser: Calvin CHIU (Global Priorities Institute)
Organiser contact email address: gpi-office@philosophy.ox.ac.uk
Host: Rossa O'Keeffe-O'Donovan (University of Oxford)
Part of: Global Priorities Institute (GPI) - Seminar Series
Booking required?: Not required
Audience: Public
This talk features in the following public collections:
- Events of interest to Social Sciences
Editors: Rossa O'Keeffe-O'Donovan, Wai Chiu