-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some elements dropped while encoding to mod_pettifor representation #21
Comments
yes, there are certain elements that have non-unique codings in some of the encodings (therefore the warning #15). I can look into making a version of the mod-pettifor that removes this issue. TBH, I didn't so far look into whether it is a bug or expected behavior. |
Worked around it in the code. I just needed to remove the "symbols" column from the DataFrame I made. I wasn't using the "symbols" data anyway. mod_petti_df = pd.DataFrame(
dict(symbol=_data.keys(), mod_petti=mod_petti_comp.keys(), contribution=mod_petti_comp.values()),
).sort_values("mod_petti") changed to: mod_petti_df = pd.DataFrame(
dict(mod_petti=mod_petti_comp.keys(), contribution=mod_petti_comp.values()),
).sort_values("mod_petti") |
sorry for coming back to this so late. Do you have a preferred way of solving this? I also do not like that element-coder/src/element_coder/data/raw/mod_petti.json Lines 105 to 120 in fa6a025
|
The following produces a list of 118 unique elements (disclaimer: contains unrealistic entries):
However, when encoding these in the
"mod_pettifor"
representation, there are 103 unique values:Not sure if #15 is related.
This is a blocker for using matbench-genmetrics with xtal2png+imagen-pytorch in sparks-baird/xtal2png#204, but not super time-sensitive. The fact that it's producing values from all 118 periodic elements despite not all elements being represented in the training dataset (pretty sure) is a concern from the generative modeling standpoint.
For context, the script I'm running is https://github.com/sparks-baird/matbench-genmetrics/blob/main/scripts/load_imagen_pytorch_generated.py.
The text was updated successfully, but these errors were encountered: