GGrantIndex
← Search

POSE: Phase I: Towards an open-source energy data ecosystem

$319,873FY2024TIPNSF

Catalyst Cooperative, Philadelphia PA

Investigators

Abstract

With this Pathways for Open-Source Ecosystems (POSE) project, Catalyst Cooperative is laying the groundwork for an open-source ecosystem that produces freely available, openly licensed, analysis-ready energy system data for use by researchers, policymakers, and small and medium sized enterprises. This new energy grid will be more dynamic and interactive than the one built during the 20th century, making computer models an increasingly important part of grid planning. Unfortunately, the high-quality data those models will need to ensure that the energy transition is smooth, equitable, and cost-effective is not currently available to the public, slowing progress and excluding many stakeholders. This NSF POSE Phase I project builds on the success of the Public Utility Data Liberation (PUDL) project, making its governance, infrastructure, and user community more robust so that PUDL can attract and retain new users and open-source contributors from a variety of disciplines. A vibrant open-source ecosystem producing and curating open energy system data will reduce duplicative data preparation, increase research productivity, improve reproducibility, and make it easier for community members to share best practices and build upon each other's work. In this POSE Phase I project, Catalyst Cooperative will chart a path towards a sustainable open-source ecosystem model and address foundational barriers to open-source contributions. Through surveys, improved usage tracking, and community discussions, the team will seek to better understand the range of potential users of open-source energy system data in the U.S., their motivations for contributing to or using the data, and the technical or institutional barriers which may prevent them from doing so. Through interactive tutorials, university brown-bags, and other outreach efforts, the project aims to identify accessible pathways for those with energy system domain expertise to acquire the software and data engineering skills needed to collaborate across disciplines in a distributed open-source ecosystem. At the same time, improving PUDL’s existing software testing and data validation frameworks will ensure that contributors can immediately understand the impacts of any changes they make, and can easily contribute without negatively impacting data quality. To plan for a growing user base, Catalyst Cooperative is also identifying appropriate governance models for the project that can help it become a self-sustaining ecosystem. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

View original record on NSF Award Search →