Time Zone: UTC

21 May 16:00 – 21 May 18:00 in Tutorials / Workshops 2

Xarray User Forum

Deepak Cherian, Anderson Banihirwe

Audience level:
Intermediate

Description

Xarray provides metadata-rich data structures that wrap array-like objects such as Dask arrays. This two-part session will highlight recent exciting advances in Xarray’s capabilities, and present user stories of Xarray+Dask usage across a wide variety of domains.

Abstract

Xarray adds dimension names, coordinate labels, and arbitrary metadata on top of Dask arrays allowing scalable, convenient, and expressive analysis code (think pandas for n-dimensional arrays). The combination of Xarray and Dask for labelled distributed data analytics has seen wide adoption across scientific fields, particularly the geosciences.

This will be a two-part session. First, an hour of updates from multiple developers on exciting progress at extending Xarray's flexibility over the past year funded by the Chan Zuckerberg Initiative. Highlights include support for array objects that follow the __array_function__ protocol; a new plugin interface for ingesting data from storage backends; and ongoing work to supercharge label-based indexing through custom indexes (e.g. those that represent grid cell intervals). Each talk will be followed by a 5-minutediscussion and Q&A on how these new features work to unlock new domain-specific functionality and use-cases. Our goal is to spark a conversation in the user community on how best to adapt and use these new features for domain-specific solutions.

Name UTC Topic
Aureliana Barghini 4.00pm New backend plugin infrastructure
Benoit Bovy 4.20pm Indexing Xarray datasets and data arrays: beyond Pandas indexes
Justus Magin 4.40pm Duck arrays in Xarray

The second half will focus on Xarray+Dask user stories from non-geoscience domains, and associated challenges. We will reach out to speakers from domains such as condensed matter physics, radio astronomy, finance, and the life sciences to put together a compelling series of three 15 minute talks that highlight use of Xarray and Dask. Each talk will be followed by 5 minutes of moderated Q&A with the speaker.

Name UTC Topic
Jack Leland 5.00pm Xarray in fusion energy research
Ravin Kumar 5.20pm Bayesian Modeling with ArViz & Xarray
Davis Bennett 5.40pm Xarray & Dask in the Life Sciences