gcamdata: An R Package for Preparation, Synthesis, and Tracking of Input Data for the GCAM Integrated Human-Earth Systems Model

Ben Bond-Lamberty; Kalyn Dorheim; Ryna Cui; Russell Horowitz; Abigail Snyder; Katherine Calvin; Leyang Feng; Rachel Hoesly; Jill Horing; G. Page Kyle; Robert Link; Pralit Patel; Christopher Roney; Aaron Staniszewski; Sean Turner; Min Chen; Felip Feijoo; Corinne Hartin; Mohamad Hejazi; Gokul Iyer; Sonny Kim; Yaling Liu; Cary Lynch; Haewon McJeon; Steven Smith; Stephanie Waldhoff; Marshall Wise; Leon Clarke

doi:10.5334/jors.232

gcamdata: An R Package for Preparation, Synthesis, and Tracking of Input Data for the GCAM Integrated Human-Earth Systems Model

Volume 7 (2019): Issue 1

By: Ben Bond-Lamberty , Kalyn Dorheim , Ryna Cui , Russell Horowitz, Abigail Snyder , Katherine Calvin , Leyang Feng, Rachel Hoesly, Jill Horing , G. Page Kyle , Robert Link, Pralit Patel , Christopher Roney , Aaron Staniszewski, Sean Turner , Min Chen, Felip Feijoo, Corinne Hartin , Mohamad Hejazi, Gokul Iyer, Sonny Kim, Yaling Liu, Cary Lynch, Haewon McJeon, Steven Smith, Stephanie Waldhoff , Marshall Wise and Leon Clarke

Open Access

|Mar 2019

Figures & Tables

**High level view of the code-data dependencies in the *gcamdata* package.** This plot of the system architecture shows nodes (“chunks”, units of code charged with processing data and producing specific outputs) and edges (data flows between chunks). Nodes are colored by discipline, e.g., agriculture and land use-related code is black, energy system code is blue, etc. For clarity neither the initial data inputs nor the final XML outputs (i.e. the GCAM input files) are shown; this means that seemingly isolated nodes or groups of nodes actually contribute data directly into the model.

**An example of tracing data flow.** Here the user has requested a data trace on a particular data object “L100.FAO_ag_Exp_t” (FAO agricultural exports by country, item, and year). The package prints detailed information about this object and its upstream and downstream dependencies, and graphs these relationships to show data flow (arrows). Raw data inputs are at the top, and the final XML product that flows into the GCAM model is at the bottom. Explanatory notes describe each step.

Table 1

Automatic package-level checks performed on the gcamdata data-handling functions (termed “chunks”) and their outputs.

Category	Test
Behavior	Chunk responds to required messages from driver (DECLARE_INPUTS, DECLARE_OUTPUTS, MAKE)
Behavior	Chunk doesn’t make forbidden calls (e.g., slow or deprecated R routines)
Chunk handles changes in model time settings
Chunk (package-level) constants are correctly formatted
Data	Chunk declares a (possibly empty) list of input that can all be found, either as the product of another chunk or as a file input
	Chunk declares a valid list of outputs
	Chunk uses only its declared inputs
	Chunk produces exactly its declared outputs
All file inputs have metadata headers and are encoded (e.g., standard line endings) correctly
All chunk outputs have title, description, units, comments, and precursor information attached
All declared precursors are in the chunk input list, and each chunk input is the precursor of at least one output
Chunk outputs match known good output set

References

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.5334/jors.232 | Journal eISSN: 2049-9647

Journal RSS Feed

Language: English

Submitted on: Jun 2, 2018

Accepted on: Feb 18, 2019

Published on: Mar 14, 2019

Published by: Ubiquity Press

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

Human-earth system modeling,

© 2019 Ben Bond-Lamberty, Kalyn Dorheim, Ryna Cui, Russell Horowitz, Abigail Snyder, Katherine Calvin, Leyang Feng, Rachel Hoesly, Jill Horing, G. Page Kyle, Robert Link, Pralit Patel, Christopher Roney, Aaron Staniszewski, Sean Turner, Min Chen, Felip Feijoo, Corinne Hartin, Mohamad Hejazi, Gokul Iyer, Sonny Kim, Yaling Liu, Cary Lynch, Haewon McJeon, Steven Smith, Stephanie Waldhoff, Marshall Wise, Leon Clarke, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.

Volume 7 (2019): Issue 1

gcamdata: An R Package for Preparation, Synthesis, and Tracking of Input Data for the GCAM Integrated Human-Earth Systems Model

Figures & Tables

Figure 1

Figure 2

Table 1

Paradigm

My account