The threat of analytic flexibility in using large language models to simulate human data: A call to attention

OxTalks Change Freeze Starts 2 March

Oxford Events, the new replacement for OxTalks, will launch on 16th March. The two-week OxTalks freeze period starts on Monday 2nd March. During this time, there will be no facility to publish or edit events. The existing OxTalks site will remain available to view during this period. Once Oxford Events launches, you will need a Halo login to submit events. Full details are available on the Staff Gateway.

The threat of analytic flexibility in using large language models to simulate human data: A call to attention

Social scientists are now using large language models to create “silicon samples” – synthetic datasets intended to stand in for human respondents, aimed at revolutionising human subjects research. However, there are many analytic choices which must be made to produce these samples. Though many of these choices are defensible, their impact on sample quality is poorly understood. I map out these analytic choices and demonstrate how a very small number of decisions can dramatically change the correspondence between silicon samples and human data. Configurations (N = 252) varied substantially in their capacity to estimate (i) rank ordering of participants, (ii) response distributions, and (iii) between-scale correlations. Most critically, configurations were not consistent in quality: those that performed well on one dimension often performed poorly on another, implying that there is no “one-size-fits-all” configuration that optimises the accuracy of these samples. I call for greater attention to the threat of analytic flexibility in using silicon samples.

Join online here: teams.microsoft.com/l/meetup-join/19%3ameeting_MjVlNjc3ZDgtMTg3NS00MzQyLWE1MjEtODE0M2UwYWQxNWY0%40thread.v2/0?context=%7b%22Tid%22%3a%22cc95de1b-97f5-4f93-b4ba-fe68b852cf91%22%2c%22Oid%22%3a%226d9f2d4d-3f0c-45ba-a72b-1f6e0a6c2234%22%7d

Date: 28 October 2025, 14:00
Venue:
Manor Road Building
Manor Road OX1 3UQ
See location on maps.ox

Details: MRB Skills Lab and Online
Speaker: Jamie Cummins (University of Bern)
Organiser: Emma Madden (University of Oxford)
Part of: Synthetic Social Science Workshop
Booking required?: Not required
Audience: Public
Editor: Emma Madden