OxTalks will soon move to the new Halo platform and will become 'Oxford Events.' There will be a need for an OxTalks freeze. This was previously planned for Friday 14th November – a new date will be shared as soon as it is available (full details will be available on the Staff Gateway).
In the meantime, the OxTalks site will remain active and events will continue to be published.
If staff have any questions about the Oxford Events launch, please contact halo@digital.ox.ac.uk
Dask is a framework for distributed computing that can scale pandas workloads in a distributed environment. This talk will start with an overview of the pandas 2.0 release and where pandas is headed in the future before diving into Dask DataFrames. Dask DataFrames offer an API that is based on the pandas API. It can jump in when pandas struggles with the size of your data through parallelizing the computations over a cluster with many workers. We will look at the core concepts of Dask DataFrames based on some examples.