-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Guidance for adding the 1PL-IRT model #147
Comments
That is very unlikely to occur. IDs are generate from UNIX timestamps with millisecond resolution, so the only way for two cards to have the same ID so for them to be created at exactly the same time, down to 1/1000 of a second. That being said, if they are from the same shared deck, yes, it's possible. But I also want to confirm this with @L-M-Sherlock, I'm not 100% sure.
|
I see. Given these two points, I think the benchmark deviates from the literature introducing models like like 1PL-IRT and the DASH family. Those models are usually trained across user collections. But I see the point of further optimizing for each individual user. The benchmark includes "FSRS-5 default param.", which is trained on the entire 10k collections. Would it be possible to do the same for other models? |
It's not exactly "trained on all 10k collections": we don't combine them into one giant collection. Instead, we optimize FSRS on every collection individually, and then take the median of each parameter, and then use those median parameters. |
I'd be curious to see the benchmark results of models trained on the "one giant collection". Is there anything in the training/testing setup blocking this? |
@L-M-Sherlock there are some questions here that you should be able to answer better than me |
My device's RAM will cry. |
Do you have any other suggestion or further question? |
Not right now |
Hi,
I want to start contributing to this repo by adding the 1PL-IRT model (also known as Rasch model), which I think is a great baseline for SRS.
The 1PL-IRT model estimates:
The model assumes the probability of a correct response is:
$$P(correct) = \sigma(\theta - \beta)$$ $\sigma$ is the logistic function.
where
I have a few questions:
other.py
, in particular to the DASH family of models which pretty much extend 1PL-IRT with review history data. The per-user and per-card parameters seem to be missing, was that intentional? Why?The text was updated successfully, but these errors were encountered: