During Michaelmas Term, OxTalks will be moving to a new platform (full details are available on the Staff Gateway).
For now, continue using the current page and event submission process (freeze period dates to be advised).
If you have any questions, please contact halo@digital.ox.ac.uk
Dask is a framework for distributed computing that can scale pandas workloads in a distributed environment. This talk will start with an overview of the pandas 2.0 release and where pandas is headed in the future before diving into Dask DataFrames. Dask DataFrames offer an API that is based on the pandas API. It can jump in when pandas struggles with the size of your data through parallelizing the computations over a cluster with many workers. We will look at the core concepts of Dask DataFrames based on some examples.