PUDL (pronounced puddle) is a data processing pipeline created by Catalyst Cooperative that cleans, integrates, and standardizes some of the most widely used public energy datasets in the US.
The data serve researchers, activists, journalists, and policy makers that might not have the technical expertise to access it in its raw form, the time to clean and prepare the data for bulk analysis, or the means to purchase it from existing commercial providers. Electric utilities report a huge amount of information to the US government and other public agencies. This includes yearly, monthly, and even hourly data about fuel burned, electricity generated, operating expenses, power plant usage patterns and emissions. Unfortunately, much of this data is not released in well documented, ready-to-use, machine readable formats. Data from different agencies tends not to be standardized or easily used in tandem. Several commercial data services clean, package, and re-sell this this data, but at prices which are too high to be accessible to many smaller stakeholders.
PUDL cleans, links, and standardizes this data, all for free!
| Organization Type: | For-profit business / social enterprise / B Corp |
|---|---|
| Status: | Active |
| Related Links: | |
| Claimed Status: | Claimed |
| Parent Organization: | Catalyst Cooperative |
| Open Source License: | https://github.com/catalyst-cooperative/pudl/blob/main/LICENSE.txt |
| Last Modified: | 4/10/2026 |
| Added on: | 11/8/2025 |
In 2025, we had 577 users register for the newly launched PUDL Data Viewer. Our data was accessed by researchers at over 70 academic institutions and was cited 14 times in academic journals, pre-prints and graduate theses. We were interviewed in the MIT Technology Review, the New York Times, Salon and Les Echos. (Source, 2025-12-31 )