-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
implement pacta.data.validation
#185
Conversation
💯 |
AFAIK, those are valid ISINs, they are just ISINs of private securities (venture capital, and things like that). Had a look at this when I was looking into Private equity, it's sorta cool that we have them actually! I would loosen the restrictions on the expectation there personally (but maybe ask Nick for a second opinion?) But in any case, TM doesn't publicize that it can cover private equity, so I think it's totally fine to just filter them out. |
#> Error in base::tryCatch(base::withCallingHandlers({: 4 assertions failed:
#> * Variable 'data$technology': must contain only valid technology
#> * names, but has additional element "ICE Hydrogen_HDV".
#> * Variable 'data$ald_production_unit': must contain only valid
#> * production units, but has additional elements "# vehicles", "dwt
#> * km", "t cement", "t coal", and "t steel".
#> * Variable 'data$ald_emissions_factor_unit': must contain only valid
#> * emissions factor units, but has additional elements "tCO2/dwt km",
#> * "tCO2/km", "tCO2/pkm", "tCO2/tkm", "tCO2e/GJ", "tCO2e/MWh", "tCO2e/t
#> * cement", "tCO2e/t coal", and "tCO2e/t steel".
#> * Variable 'data$technology': must contain only valid technology names
#> * for HDV, but has additional elements %s. This part is pretty important. Could be coming from two places, either:
I guess we have a decision to make regarding:
Of the two, I guess I would probably prefer the former, which will require a PR to: Hopefully, we just need to adjust the strings themselves, and don't need to change the actual data/ assumptions at all. |
This seems to just be an issue with HDV, which I don't really use at all (it's still not supported in What technology name were you using for #> * Variable 'data$technology': must contain only valid technology names
#> * for HDV, but has additional elements %s. |
pacta.data.validation
relevant fixes in pacta.data.validation:
relevant fix in pacta.data.preparation validation of |
closes #18
Most validation errors originally found (below) have been resolved. Validation of
financial_data
andabcd_flags_equity
has been removed for now and will be added in the future.investigation issues:
relevant fixes in pacta.data.validation:
"ICE Hydrogen_HDV"
in sector"HDV"
pacta.data.validation#65relevant fix in pacta.data.preparation
factset_entity_id
that areNA
and therefore not useful pacta.data.preparation#18validation of
financial_data
andabcd_flags_equity
has been removed from this PR, and future intended implementation is tracked herefinancial_data
andabcd_flags_equity
when ready #222currently, a few of the validation checks fail using 2023Q4 config