OxTalks is Changing
OxTalks will soon move to the new Halo platform and will become 'Oxford Events.' There will be a need for an OxTalks freeze. This was previously planned for Friday 14th November – a new date will be shared as soon as it is available (full details will be available on the Staff Gateway).
In the meantime, the OxTalks site will remain active and events will continue to be published.
If staff have any questions about the Oxford Events launch, please contact halo@digital.ox.ac.uk
Zero-shot numerical reasoning in dual stream neural networks and the primate visual system
Human viewers learn abstract concepts corresponding to visual relational properties and then can generalize these concepts zero-shot to new objects and contexts. Numerosity is a prime example of this ability. Once a child has learned the abstract concept of ”threeness”, she will forever be able to recognize groups of three objects, even novel objects in novel contexts, without any additional learning. This is not the case for modern neural network-based computer vision systems, which, while highly proficient at object recognition, struggle to generalize relational properties like cardinality. Here, we show that a recurrent dual-stream neural network, inspired by the role of the dorsal stream in primate vision, which apprehends an image via a sequence of foveated glimpses, displays zero-shot generalization in numerical reasoning tasks. This zero-shot generalization behaviour is not observed in parameter-matched control models which receive the entire image as input. Neither stream of the dual-stream model is sufficient to solve the task alone. The dual-stream model replicates several neural and behavioural phenomena associated with human and monkey enumeration. Analyzing the activity of the recurrent layer revealed several response properties associated with posterior parietal cortex (PPC): log-normal number coding, an over-representation of units selective to the extremes of the number range, and place-selective spatial receptive fields. Inspection of the pattern of errors made at different levels of proficiency revealed that, like human learners, the dual-stream model masters smaller numerosities first, gradually refining larger numerosities. Our characterization of the computational principles that support zero-shot numerical reasoning are consistent with converging theories of the role of PPC in visual reasoning. According to this theory, efferent copies of motor or attention signals are received as an additional input to a visual reasoning system, enabling abstractions that are grounded in action, rather than purely in the sensory domain. The success of our model suggests that “attention” may be an inactive mechanism for relational inference, rather than (merely) a spatial prioritisation scheme.
Date:
22 February 2024, 14:30
Venue:
Sherrington Library, off Parks Road OX1 3PT
Speaker:
Jessica Thompson (University of Oxford)
Organising department:
Medical Sciences Division
Organisers:
Dr Rafal Bogacz (University of Oxford),
Dr Rui Ponte Costa (University of Oxford)
Host:
Dr Rui Ponte Costa (University of Oxford)
Part of:
Oxford NeuroAI Forum
Booking required?:
Not required
Audience:
Members of the University only
Editor:
Rui Costa