This repository has been archived by the owner on Jan 12, 2024. It is now read-only.
Automate generation of EPA CEMS metadata for data catalog export #2
Labels
epacems
The EPA's Continuous Emissions Monitoring System hourly dataset
inframundo
intake
Intake data catalogs
metadata
Data about our liberated data
We want to integrate column and table metadata (e.g. text descriptions) into the source definition in
pudl_catalog.yaml
so that users can understand what data is available when browsing the catalog. This information is currently being written into the column and table metadata within the Parquet files during ETL, so it could be read from there. It could be exported from our Pydantic metadata models when we generatepudl_catalog.yaml
.pudl_catalog.yaml
. This should include at least:Resource.to_intake_data_source()
method that can generate the Intake data source levelmetadata
entry.The text was updated successfully, but these errors were encountered: