Store Retrievability in Review Logs

sorata · August 7, 2025, 7:24pm

Take this text from the tooltip Anki uses for Retention stats (formerly “True Retention”):

If you are using FSRS, your retention is expected to be close to your desired retention.

Now, that should be generally true, but —

We can have backlogs
We study outside regular schedule in filtered decks
We change DR from time to time

If none of that applies, at least one would have reviews from cards that weren’t rescheduled after the last regular optimisation.

The point is that retrievability at the time of review can be very different from the DR that you’ve set so Anki. This makes comparison harder, because you can’t just compare your actual retention with whatever the desired retention value is.

Therefore, Anki should store retrievability for each and every review just like it does for difficulty. That means we can come up with average retrievability for all the reviews we have done and can compare it to the retention.

I’ve also started to realise — and what prompted this post — that storing difficulty isn’t very useful for the user. Apart from the fact that it’s a really unintuitive number (what is 86.7% difficulty?), it also is not comparable across presets and collections. So what value does it provide? I really think storing retrievability is a better choice.

Better ideas are welcome.

Keks · August 7, 2025, 8:41pm

Retrievability depends on the FSRS parameters. And they change with every optimization. Why record information that becomes obsolete over time? Moreover, if necessary, it can be calculated.

If you change True Retention, it will no longer be True Retention. The fact that this table allows you to indirectly evaluate the work of FSRS is only an additional function.
To evaluate FSRS, there is already a function ⁨"Check health when optimizing (slow)".

If you need some kind of numerical measure of FSRS accuracy, you could suggest returning log loss and RMSE.

Another option for checking the accuracy of FSRS:

https://ankiweb.net/shared/info/1613056169

sorata · August 7, 2025, 10:11pm

Do you mean we can retroactively calculate all the previous R values? That would be awesome then.

But as for recording the R values in revlog, I agree it’s weird to record something that can become obsolete. But that felt like the only option, besides Anki records difficulty anyway which is unhelpful and can be confusing even (people get confused when the D value suddenly change after a optimisation).

The goal is to evaluate individual performance rather. This is also useful if you’re comparing different hours of the day:

L.M.Sherlock · August 8, 2025, 9:17am

The forgetting curve has shown the historical retrievability on the card info screen.

sorata · August 8, 2025, 3:25pm

Oh, you’re right. Then I guess it would be easy to get the data and put it in the stats screen as “expected retention”.

@A_Blokee Would it be something you’re interested in? Given you already deal with stats in your add-ons.

A_Blokee · August 8, 2025, 3:59pm

Do you mean one of these?

sorata · August 9, 2025, 6:43am

They’re similar graphs but take a look at this for what I’m thinking about:

Similar to that, I think Anki can show “expected retention” in the retention table (“true retention” formerly). So, you have the “actual retention” and you have the “expected retention” to contrast them both.

Topic		Replies	Views
Average retrievability not equal to average predicted retention FSRS	15	564	May 16, 2024
Add analysis of review logs in the optimizer of FSRS Scheduling	6	983	May 1, 2023
Questions regarding FSRS optimal retention/optimization FSRS	13	2421	April 6, 2024
New features of FSRS4Anki from v3.0.0 to v3.6.0 Scheduling	7	1915	May 1, 2023
With FSRS 5, Allowing new Reviews before next day to increase average retention FSRS	11	788	December 18, 2024

Store Retrievability in Review Logs

Related topics