Update simulator for FSRS to compare with Anki's built-in scheduler

L.M.Sherlock · October 10, 2022, 11:50am

Continuing the discussion from New progress in implement the custom algorithm:

I updated the simulator based on the DSR memory model and compared Anki and FSRS.

The setting of experiment is:

# parameters for FSRS
w = [1.014, 2.2933, 4.9588, -1.1608, -0.9954, 0.0234, 1.3923, -0.0484, 0.7363, 1.6937, -0.4708, 0.6032, 0.9762]
requestRetention = 0.9  # recommended setting: 0.8 ~ 0.9

# parameters for Anki
graduatingInterval = 1
easyInterval = 4
intervalModifier = 1
newInterval = 0
minimumInterval = 1

# common parameters
maximumInterval = 36500
easyBonus = 1.3
hardInterval = 1.2

new_cards_limits = 20
review_limits = 1000
learn_days = 450
deck_size = 6000

# get the true repetitions from review logs
filename = "ALL__Learning.apkg"

# smooth curves
moving_average_period = 14

The parameters for FSRS are generated from my collection.
The parameters for Anki is the default setting.
New cards/day: 20
Maximum reviews/day: 1000
Days to simulate: 450
Deck size: 6000

Note: The simulator hasn’t supported (re)learning steps. And I won’t implement it because steps bigger than one day are complicated to handle.

Here are some interesting figures I want to introduce:

In the number of due cards per day, FSRS reduced by 10% compared with Anki.

In the number of repetitions per day, FSRS reduced by 8% compared with Anki.

The retention of Anki is too higher in early stage.

You can try it at Google Colab:

biodecus · October 10, 2022, 12:30pm

Nice comparison, thanks for putting this together.

Would be really interesting to be able to get SM-18 into the comparison, but obviously not possible since it’s closed source.

It’s not possible to get much hard data on SM-18, but Guillem Palau, who was a heavy Anki user for 11+ years has mentioned that he’s ended up with slightly higher retention from SM-18 but with roughly half the repetitions of Anki SM-2.

L42 · October 24, 2022, 1:55pm

This simulator is a great idea. I’ve been using a custom scheduler based on a speculation of how a certain app works, but could never get the retention above 90%. This simulator has helped me work out some quirks. Thanks.

FSRS looks promising. However, it requires the right weights for each study subject. This means more tuning and customization from the user’s side which has always been a headache for those new to SRS. I would not advice users to go above 20% FI (below 89.62% retention). This has proven to harm learning leading to problems down the line. It was written extensively in SM blogs.

[quote]
Setting the forgetting index above 20% would be like giving up SuperMemo altogether and coming back to remembering only that what is easy to remember…Nevertheless, if you want to maximize the speed of learning with little control over what actually stays in your memory, set the forgetting index to 20%[/quote]

I think if we have more shared profiles, we could analysis these in depth. Maybe create a tool to wipe the card data and export them for anonymous sharing? I didn’t want to take up the task myself, any brave souls? I’ll leave you with some screenshots comparing different algos, it took forever to generate these. (Seems like I’m limited to 5 images.)

kuroahna · October 24, 2022, 3:34pm

I’ve been using a custom scheduler based on a speculation of how a certain app works, but could never get the retention above 90%

Are you able to share the details of the custom scheduler that you’re referring to here?

These graphs that you shared are interesting. I never heard of the “kensho” algorithm before (I can’t find anything on google either). It seems strange that it’s able to maintain a near 95% retention with the lowest amount of repetitions compared to all algorithms. Generally, the higher the retention rate, the more reviews you need to do (smaller intervals), so how is this possible?

L.M.Sherlock · October 24, 2022, 4:46pm

The latest version of FSRS optimizer can calculate the optimal retention, and you can try it with the simulator.

L.M.Sherlock · October 24, 2022, 5:02pm

It only counts the retention on reviewed cards per day. If an algorithm only schedule the review on a small set of cards, the retention could be higher. To avoid this cheating, I will add a figure to show the retention on all cards.

L.M.Sherlock · October 25, 2022, 2:39am

I update the simulator in v3.8.0 with counting the expected memorization.

L42 · October 25, 2022, 5:07am

@kuroahna

Sorry for the misunderstanding. Kensho is a code name for a scheduler I made for personal use. Nothing published. It’s a no brainer which app I was referring to.

@L.M.Sherlock

Your link above still points to the old version using google colaboratory.

Calling it cheating is a bit rude don’t you think, the goal is to get to the cause of the problem. It would be nice if we can use a common apkg file for these tests. Can you provide a link to an apkg that I can use? I don’t really trust this collection. It’s full of old scheduling data, so it might be throwing off your simulator.

Full disclaimer: For the images below, I took the changes for “expected_memorization_per_day” and patched it to the old version v3.3.2 for this test. It would be too much work otherwise.

L.M.Sherlock · October 25, 2022, 5:29am

Sorry for that, I meant the indicator is misguiding. It is interesting to do some research on your scheduler.

L.M.Sherlock · October 25, 2022, 5:41am

Can you use the collection file in this issue?

github.com/open-spaced-repetition/fsrs4anki-helper

Converting from Anki SM-2 to FSRS: Rescheduling gives a lot of cards?

opened 09:03PM - 16 Oct 22 UTC

kuroahna

Anki Version 2.1.55 Beta 2 FSRS Scheduler: v3.4.0 FSRS Optimizer: v3.5.0 FSRS… Helper Addon: v3.4.0 I'm rescheduling all my cards to use the FSRS algorithm now, but I'm not 100% sure if this is intended or if there's perhaps a bug? 1. Open Anki 2. Click File -> Switch Profile 3. Click Add 4. Name it `Testing` (or any other name) 5. Click ok 6. Import my collection [collection2.zip](https://github.com/open-spaced-repetition/fsrs4anki-helper/files/9795587/collection2.zip) 7. See 499 cards due ![image](https://user-images.githubusercontent.com/85209455/196056486-730fb04f-01d8-4276-914f-2972b431dedc.png) 8. Click Gear icon for the 日本語 deck 9. Click Options 10. Add Custom scheduling code with my parameters (v3.4.0) [scheduler.zip](https://github.com/open-spaced-repetition/fsrs4anki-helper/files/9795597/scheduler.zip) Note I've only changed the following parameters ```js var w = [1.3027, 0.5276, 5.1668, -1.3924, -1.0168, 0.0089, 1.2972, -0.0173, 0.6662, 1.8265, -0.4096, 0.688, 0.4756]; let requestRetention = 0.86; ``` 11. Click Save 12. Click Tools -> Reschedule All Cards 13. See 7615 cards due ![image](https://user-images.githubusercontent.com/85209455/196056628-2f2c623b-f563-4a0b-8427-593caff5c484.png) Perhaps this is intended and I really do need to review all 7615 cards due to my parameters, but I'd like some confirmation whether or not this is expected behaviour? I've also noticed some interesting values with the `customData` after rescheduling. 1. Right click anywhere in anki in the deck screen 2. Click Inspect (Install webview inspector addon) 3. Click the 復習 filtered deck 4. Click Rebuild 5. Click Study Now 6. Anki shows me the card with the Word 女中 (card id is `1648356923674`) 7. Click Browse 8. Right click highlighted card (it is 女中 for me) 9. Click Info... 10. See these values ![image](https://user-images.githubusercontent.com/85209455/196057442-2167c3c1-70f1-4a0b-8967-4efc5d648ca3.png) Note that lapses is 0 (I resetted the card before on 2022-07-05 by right clicking the card and clicking Forget Card..., which resets the ease factor + lapses count) 12. In the WebView Inspector, I see the values ![image](https://user-images.githubusercontent.com/85209455/196057531-53bea224-73e8-45e6-ab23-05998b06c547.png) Note that `interval=60`. This means that 60 days have been elapsed since my last review, which was on 2022-08-17 for this card, and Anki SM-2 schedules it in 2.37 months from 2022-08-17, which should be about in 72.08 days, or on 2022-10-28. However, rescheduling the card using FSRS Helper assigns it to me today instead. I assume this is because of my parameters? However, it is interesting to see that `last_s=11.806` and `last_d=9.8721` and `retrievability=0.585400431176371` Shouldn't `last_s` be a much higher value than `11.806`? From my review history, when I resetted the card on 2022-07-05 and relearned the card, I've always pressed Good on the card and haven't failed the card. Of course, before 2022-07-05, I've failed the card 4 times before resetting the card. In Anki, I set Lapse Threshold set to 4 and Auto Suspend the card. This allows me to find leeches fast, which forces me to reformulate the card and make it easier for me to understand. Then I forget the card and relearn it as if it was completely new. Does this affect how the FSRS optimizer/scheduler/helper addon works? Is this why the `last_s` is `11.806` instead of a much higher value? Similarly for `last_d`? Shouldn't `last_d` be a much lower value than `9.8721` since I haven't failed the card at all yet? I feel like `last_s` and `retrievability` should be much higher values for this card, and `last_d` should be a much lower value. Perhaps this is also why the helper addon is scheduling me +7000 extra cards? It'd be much more understandable if it was maybe ~1000 cards or so, but ~7000 seems quite a lot In the code, https://github.com/open-spaced-repetition/fsrs4anki-helper/blob/6e58aa90e786f7581e9db12346cb5a6052792669/reschedule.py#L87-L111 I see that it loops through the revlog history and replays the memory states, which takes into account the revlog history _before_ I resetted the card. Maybe it should start from the latest reset point? It'll probably need to loop through the revlog in reverse (starting from the end), looking for when it was last reset, then it'll start going through the memory states with that as the starting point. Otherwise, if the card wasn't reset at all, then we use the whole revlog as previously done. Also, another question, for the Optimizer, does it make sense to handle this case as well? Should we ignore all the data points in the revlog before the card reset since it's no longer "valid" data? Or does it make sense to keep it because if we ignore it, we'll have bias in the data set? I'm not too knowledgeable in this field, but just wanted to get your thoughts on this.

L.M.Sherlock · October 25, 2022, 6:16am

I don’t know the details of your own scheduler. So I have some problems to these figures:

L42 · October 25, 2022, 9:35pm

That’s probably because I was messing around with different startup intervals and forgot to change it back for this test?

The result is less pronounced on a regular user’s collection, so it’s definitely something in my collection that’s giving the algo an advantage. I’ll dig into it later. This is addicting, but also mentally distracting.

kuroahna · October 25, 2022, 10:37pm

Are you interested in making your scheduler algorithm open source for others to see and provide feedback? There’s not going to be much discussion here if all we can see is just the graphs

Vitorvlv · January 3, 2023, 9:35pm

Hi! I was just using the simulator and could use some help interpreting the results.

I’ve tried to simulate the anki vs fsrs using the deck I used during 2022 with the options I was using. These were the changes I made:

W = 0.286, 1.01, 5.143, -0.6442, -0.597, 0.0331, 1.3379, -0.2, 0.7318, 1.6617, -0.5291, 0.7266, 0.5825 (value the optimizer gave me for this deck)

requestRetention = 0.9
graduatingInterval = 3
newInterval = 0.2
easyBonus = 1.5
leechSuspend = False
new_cards_limits = 100
review_limits = 300
learn_days = 364
deck_size = 3835

and I added my deck to the simulator.

To my surprise I didn’t find the same results you did on your simulation. FSRS did managed to stick to the target retention rate (90%), but it gave so much more repetitions that I’m not sure if it was worth it.

Here are the results with the 0.9 retention:
0.9
0.9 1
0.9 2
0.9 3
0.9 4
0.9 5

Settings the target retention to 85% did result in a better performance from FSRS. But what does this means? If I’m targeting 90% retention should I just stick with default anki?

Results with 0.85:
Captura de tela de 2023-01-03 18-33-55

85 1
85 2
0.9 3
85 4
85 5

L.M.Sherlock · January 4, 2023, 2:25am

If your target retention is 90%, using default Anki will not achieve the retention.

Vitorvlv · January 4, 2023, 4:27am

That’s right, forgot about that, sorry.

I played around with the interval modifier until SM-2 was giving a similar retention to FSRS and FSRS did achieve it in less repetitions. Sorry for the confusion!

Captura de tela de 2023-01-04 01-24-48
0.78

krstoevan · April 29, 2023, 5:34pm

hi,
i seldom hear people talk about priority, but i heard from your supermemo lecture that priority is a great point in supermemo.

so, will frsr bring priority to anki? or something like that?
thanks

system · May 29, 2023, 5:34pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Anki simulator and FSRS Scheduling	1	584	July 6, 2023
Big update in FSRS4Anki v3.0.0 Scheduling	24	3472	May 1, 2023
Anki SM2 beating FSRS in FSRS Simulator Scheduling	4	1500	October 23, 2023
Why does FSRS provide fewer review than the original Anki SM-2? Scheduling	6	1787	November 9, 2023
FSRS4Anki 3.14.0 Pre-release Add-ons	2	672	February 24, 2023

Update simulator for FSRS to compare with Anki's built-in scheduler

Related topics