One of the ways we promote open science and FAIR data is to highlight unique MSD-relevant datasets or models that have embraced the FAIR principles. If you created or work with such a dataset and would like to share your work with the community please reach out to Casey or Adam.


Stay tuned for upcoming webinars. We have had community feedback that there is a desire to learn more about the following topics:

  • Open source licenses: Which one is right for your work?
  • Data repositories: Which one is right for your work?


June 2020:

  • Title: The FEWSION Food-Energy-Water Dataset
  • Author: Benjamin Ruddell, Northern Arizona University
  • Abstract: FEWSION is a significant NSF-funded research effort focused on mesoscale food-energy-water systems data synthesis in the United States. This effort involves engineering a coupled natural human systems data model for commodity (and other) inputs and outputs, including adoption of database structures and standards, controlled vocabularies and crosswalks, metadata, documentation, scientific workflows, high performance computing optimization, and data fusion/linking strategies to integrate heterogeneous spatiotemporal data sources. Visualization is also a challenge, and patent-pending visual analytics strategies have been developed and prototyped in the FEW-View system available since April 2019 on the website https://fewsion.us. The first version of the dataset is already available to collaborators, although unpublished, and the second version is scheduled for a more complete publication and release in 2020-2021.

September 2020:

  • Title: Open Source Historical Emissions Data Generation With The Community Emissions Data System (CEDS)
  • Author: Steve Smith, Pacific Northwest National Laboratory
  • Abstract: Anthropogenic emissions of reactive gases, aerosols, and aerosol precursor compounds have substantially changed atmospheric composition. Increased particulate and tropospheric ozone concentrations since pre-industrial times have altered the radiative balance of the atmosphere, increased human mortality and morbidity, and impacted terrestrial and aquatic ecosystems. Central to studying these effects are historical trends of emissions. Historical emissions data and consistent emissions time series are particularly important for Earth System Models (ESMs) and atmospheric chemistry and transport models, which use emissions time series as key model inputs. Emissions data are also used by multi-sector models of the human-Earth system as a historical starting point for analysis and projection of future emissions scenarios. For example, in the Coupled Model Intercomparison Project (CMIP), a key international benchmarking exercise for ESMs, a common “forcing” dataset is required for historical emissions and other inputs. In previous phases of CMIP historical emissions data were compiled from different sources using inconsistent methodologies between emission species and over time that also lacked uncertainty estimates and reproducibility.

February 2021:

  • Title: Applying FAIR Principles in the Capacity Expansion Regional Feasibility Model
  • Authors: Kristian Nelson, Chris Vernon, and Jennie Rice – Pacific Northwest National Laboratory
  • Time: Tuesday, February 16 2021 at 9:30 am Pacific
  • Abstract: The FAIR guidelines require data to be findable, accessible, interoperable, and reusable and are key to making scientific datasets available to the community in a way that facilitates efficient data-intensive research. It is important for scientists to follow these guidelines and solidify them as standard practice in the community, so that data products can be properly organized and accessible for use in other research. As an exercise in implementing these principles in the Capacity Expansion Regional Feasibility Model (CERF), we amalgamate, archive, and distribute energy expansion data into a commonly used format, to provide to the scientific community as a reusable service. We describe the entire process of planning, discovery, harvesting, documenting, archiving, publishing, and serving data for the CERF model. The process of using these guidelines in this model can be an example for other scientists to make their data FAIR. Like many data-intensive projects, this work requires a multitude of datasets that encompass a variety of sectors, scales, and formats. Abiding by the FAIR principles ensures that this data service can be accessible and reusable. These guidelines are a useful tool to help scientists organize and store future work, and it is important to make it a standard practice in the data-science world.

May 2021:

  • Title: Assessing Compounding and Co-Evolving Risks Across Multiple Systems and Sectors: A Triage-based Approach
  • Authors: Adam Schlosser, Cypress Frankenfeld, and Shelli Orzach – Massachusetts Institute of Technology
  • Time: Tuesday, May 4 2021 at 9:30 am Pacific
  • Abstract: Physical and transition risks across the natural, managed, and built environments are becoming increasingly complex, multi-faceted, and compounding. Changes to the future landscapes of socio-economic and environmental stressors and drivers may co-evolve, interact, and lead to salient tipping points and instabilities. Yet – in order to provide resilient and adaptive solutions these situations require in-depth, risk-based analyses and model assessments. However, these deep-dive efforts are intensely consumptive in terms of time, computational, and investigative resources. In practical terms, such exercises cannot be performed extensively – but rather selectively in terms of priority, scale, and/or the critical nature of the threat. Therefore, an efficient analysis platform through which the variety of multi-systems/sector observational and simulated data can be readily incorporated, harmonized, combined, diagnosed, visualized, and shared with the community-at-large could prove a useful screening-level capability.

    In view of this, a “triage-based” visualization and data-sharing platform has been constructed that has the ability to survey, overlay, and combine supporting data on the current landscapes of risk, demographics, and infrastructure. A total multi-stress risk index can then be created (and downloaded) from a weighted combination of all the indicators deemed important and can be based on their relevance to user specifications and areas-of-concern. While the platform is under continual development, the risk metrics that are currently considered include: land-, water-, and energy systems, as well as demographics, environmental justice, and transportation networks. We will demonstrate this platform’s current geographic focus over the United States (including Alaska, Hawaii, and Puerto Rico) to resolve these landscapes at the county-level scale. Our expected near-term efforts will augment the platform to develop and incorporate aspects on human health and environmental diversity. Further work will incorporate projections of future changes to the supporting data and/or risk metrics that are produced with our Integrated Global System Model (IGSM) framework. The web-based platform is an open-science tool and available to the community-at-large. We view the continued development and enhancement of its capabilities through an open, accessible, and interactive approach with the academic, research, public, and industry communities.