Graphic representing The Public Utility Data Liberation (PUDL) Project

The Public Utility Data Liberation (PUDL) Project


https://catalyst.coop/pudl/
Boulder, CO

PUDL (pronounced puddle) is a data processing pipeline created by Catalyst Cooperative that cleans, integrates, and standardizes some of the most widely used public energy datasets in the US.

The data serve researchers, activists, journalists, and policy makers that might not have the technical expertise to access it in its raw form, the time to clean and prepare the data for bulk analysis, or the means to purchase it from existing commercial providers. Electric utilities report a huge amount of information to the US government and other public agencies. This includes yearly, monthly, and even hourly data about fuel burned, electricity generated, operating expenses, power plant usage patterns and emissions. Unfortunately, much of this data is not released in well documented, ready-to-use, machine readable formats. Data from different agencies tends not to be standardized or easily used in tandem. Several commercial data services clean, package, and re-sell this this data, but at prices which are too high to be accessible to many smaller stakeholders.

PUDL cleans, links, and standardizes this data, all for free!

Organization Type: For-profit business / social enterprise / B Corp
Status: Active
Related Links:
Claimed Status: Claimed
Parent Organization: Catalyst Cooperative
Open Source License: https://github.com/catalyst-cooperative/pudl/blob/main/LICENSE.txt
Last Modified: 4/10/2026
Added on: 11/8/2025

Project Categories

Evidence of this project's impact:"

In 2025, we had 577 users register for the newly launched PUDL Data Viewer. Our data was accessed by researchers at over 70 academic institutions and was cited 14 times in academic journals, pre-prints and graduate theses. We were interviewed in the MIT Technology Review, the New York Times, Salon and Les Echos. (Source, 2025-12-31 )

Back to Top