Skip to main content
Have a personal or library account? Click to login
xarray: N-D labeled Arrays and Datasets in Python Cover

xarray: N-D labeled Arrays and Datasets in Python

By: Stephan Hoyer and  Joe Hamman  
Open Access
|Apr 2017

Abstract

 xarray is an open source project and Python package that provides a toolkit and data structures for N-dimensional labeled arrays. Our approach combines an application programing interface (API) inspired by pandas with the Common Data Model for self-described scientific data. Key features of the xarray package include label-based indexing and arithmetic, interoperability with the core scientific Python packages (e.g., pandas, NumPy, Matplotlib), out-of-core computation on datasets that don’t fit into memory, a wide range of serialization and input/output (I/O) options, and advanced multi-dimensional data manipulation tools such as group-by and resampling. xarray, as a data model and analytics toolkit, has been widely adopted in the geoscience community but is also used more broadly for multi-dimensional data analysis in physics, machine learning and finance.

DOI: https://doi.org/10.5334/jors.148 | Journal eISSN: 2049-9647
Language: English
Submitted on: Sep 7, 2016
Accepted on: Feb 23, 2017
Published on: Apr 5, 2017
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2017 Stephan Hoyer, Joe Hamman, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.