Model comparison #152
Replies: 8 comments 4 replies
-
Beta Was this translation helpful? Give feedback.
-
The site that improves the most using the metab_dense model is It looks like the baseline model is predicting a big anomaly in winter that doesn't exist, and the metab_dense model is not, that probably accounts for some of the model improvement. Other than that anomaly it actually looks like the baseline model does better at this site particularly with do_max. Also meta_dense under predicts do_min worse than the baseline, although both under predict the troughs. |
Beta Was this translation helpful? Give feedback.
-
This is the same style time series from site Interestingly, it looks like the metab_dense model does a better job with the lowest values, such as the troughs in do_min. The baseline model looks like it pegs out, but the metab_dense model seems to follow the curve much better. It also looks like it tracks the rising limb out of the trough much better. |
Beta Was this translation helpful? Give feedback.
-
Just for fun here is another one, this is for site This one also shows an improvement in the bias, particularly for do_mean, and better tracking of the rising limb, particularly in do_min. |
Beta Was this translation helpful? Give feedback.
-
Adding in tmin as a dynamic predictorThe plots above were made using tmax, precip, and solar radiation as the dynamic predictors. Adding in tmin doesn't have a huge effect, but it does seem to improve the validation sites for do_min and do_max in general: Might be easier to see the differences when the sites are aggregated together: |
Beta Was this translation helpful? Give feedback.
-
My thought was that adding tmin would have the greatest effect on do_min because tmin would set the baseline for how much ER occurred during the night time? Not sure if that is the right reasoning, but then I looked at how the addition of tmin affected the prediction of metab model outputs. This could only be done for the So adding tmin improves prediction of ER, but decreases performance for GPP across the board, and is pretty mixed for the other variables. |
Beta Was this translation helpful? Give feedback.
-
Also, in this run the large anomaly in the baseline model for site 01481500 was not present. This makes me want to run more replicates and compare ensembles instead of single runs: |
Beta Was this translation helpful? Give feedback.
-
Following up on this, I ran 5 replicates of the baseline LSTM model with and without tmmn as a predictor. Here's the RMSE ("val" partition only) from the overall_metrics file, so averaging across observations and sites: The global RMSE values suggest that adding tmmn probably results in a small improvement across the three target variables, perhaps more so for DO-min and DO-max (consistent w/ Galen's assessment from 1 rep). In our meeting on 10/5/22, I mentioned that it seemed like the model w/ tmmn was actually performing worse for 4 out of the 6 validation sites based on 1 replicate model run. After running more reps, I don't think the tmmn model necessarily performs worse but rather, the differences are generally small relative to the variability among model reps. What stands out to me is the large variability among reps for Here's the timeseries for I was hoping to get a sense for how the model might be using tmmn (aside from what is already gleaned from tmmx), so I plotted global permutation feature importance across reps as well as the expected gradients for tmmn and tmmx for one 12-month sequence (6 months are shown below; from model rep 4). If I'm interpreting the EG plots correctly, it seems like for this site ( |
Beta Was this translation helpful? Give feedback.
-
This discussion is about comparing model runs and visualizing results. All models are run using the NHD fabric on the medium observed data which sets a threshold of at least 100 DO observations per site, and results in 14 sites.
Sites
Models
do_min
,do_mean
, anddo_max
do_min
,do_mean
,do_max
,GPP
, andER
(also predictK
,Temp
,Depth
but don't include in loss func)do_min
,do_mean
,do_max
,GPP
,ER
,K
,Temp
,Depth
GPP
,ER
,K
,Temp
,Depth
use dense layer to then predictdo_min
,do_mean
,do_max
,Beta Was this translation helpful? Give feedback.
All reactions