A Matter of Principle? AI Alignment as the Fair Treatment of Claims - Ethics in AI Lunchtime Research Seminars

OxTalks is Changing

OxTalks will soon move to the new Halo platform and will become 'Oxford Events.' There will be a need for an OxTalks freeze. This was previously planned for Friday 14th November – a new date will be shared as soon as it is available (full details will be available on the Staff Gateway).

In the meantime, the OxTalks site will remain active and events will continue to be published.

If staff have any questions about the Oxford Events launch, please contact halo@digital.ox.ac.uk

A Matter of Principle? AI Alignment as the Fair Treatment of Claims - Ethics in AI Lunchtime Research Seminars

Please note: week 2 starting time is 11:30 am

The challenge of AI alignment centres upon what goals or values to encode in AI systems to govern their behaviour. A number of answers have been proposed, including the notion that AI must be aligned with human intentions or that it should aim to be helpful, honest and harmless. Nonetheless, both accounts suffer from critical weaknesses. On the one hand, they’re incomplete: neither specification provides adequate guidance to AI systems, deployed across various domains with multiple parties. On the other hand, the justification for these approaches is questionable and, I shall argue, of the wrong kind. More specifically, neither approach takes seriously the need to justify the operation of AI systems to those affected by their actions – or what this means for pluralistic societies where people have different underlying beliefs about value. To address these limitations, I’ll develop an alternative account of AI alignment that focuses on fair processes. This account holds that principles that are the product of these processes are the appropriate target for alignment. This new approach meets the necessary standard of public justification, generates a fuller set of principles for AI that are sensitive to variation in context, and has explanatory power insofar as it identifies a set of formerly underappreciated ways in which AI systems may cease to be aligned.

Date: 29 January 2025, 11:30
Venue: Please register to receive venue details
Speaker: Dr Iason Gabriel (Google DeepMind)
Organiser contact email address: aiethicsevents@philosophy.ox.ac.uk
Hosts: Dr Thomas Mitchell (Philosophy, Oxford), Dr David Storrs-Fox (Philosophy, Oxford)
Part of: Ethics in AI Lunchtime Seminars
Booking required?: Required
Booking url: https://www.oxford-aiethics.ox.ac.uk/matter-principle-ai-alignment-fair-treatment-claims
Booking email: aiethicsevents@philosophy.ox.ac.uk
Cost: Free
Audience: Public
Editor: Marie Watson