Refactor code #75

vlopezferrando · 2024-12-31T00:20:26Z

Hi, I've followed this repository for a while and thought I'd contribute some refactors of the code. I have tried to keep the logic the same (haven't changed the tests) but simplified the logic wherever possible and reduced the amount of repeated code.

This is a work in progress, but I'd like to know if work like this would be welcome and take any suggestions.

My plan was to simplify the code as much as possible and then port it to my other project simple-spaced-repetition, which has a more minimalistic API (single Card class).

Update: this ended up being an almost complete rewrite of the code, but keeping the same functionality. Let me summarize the changes and the logic behind them.

Code split in 3 files

I have split the code in 3 files:

models.py: which has the Rating, State, Card and ReviewLog classes. These classes just serve the purpose of holding the data and its serialization, but there is no business logic.
fsrs_v5.py: a new FSRSv5 class that encapsulates all the FSRS v5 logic. To put it in another way, this is the place where all the 19 weights are used, and only the 19 weights are used.
scheduler.py: the Scheduler class implements the top level class that users should use. It is a wrapper around FSRS v5 that adds learning/relearning steps, fuzzing and interval clamping.

Test changes

The tests have stayed the same except for two cases:

The scheduler now has a fsrs attribute with a FSRSv5 object. The attributes' comparison when serializing/deserializing failed because objects had a different address.
Instead of raising a ValueError when using a naive datetime, I believe it is more convenient to just add a UTC timezone to the datetime. Also, if the datetime is timezone aware but has a different timezone, we can use it directly.

Important refactors

The logic stays the same for the rest of the code, but I would highlight some of the improvements on the new approach:

Centralizing all the FSRS v5 algorithm in a single class: it makes it much easier to understand the algorithm, plus I added docstrings with explanations from the algorithm spec.
Refactor of review_card. Before it was a huge function, now it is much smaller and easier to understand.
Extraction of update_from_steps function, that is used both for learning and relearning steps.
Reduced the lines of code by simplifying a lot of conditionals and other logic, while increasing the amount of comments to make the code more understandable.

I know this is a huge rewrite, but hopefully you find it useful. Even if you don't feel like merging all of it, I think it can provide some ideas on how to improve the code.

joshdavham · 2024-12-31T02:25:28Z

Hey Victor, I should be able to look at this PR in about two or three days, since I'm currently travelling at the moment.

But as for initial thoughts: we are likely going to be adding an optimizer to this package in the coming month which may change the codebase a fair bit so I'd be a bit hesitant to do another refactoring right away. I'd feel more comfortable to implement the optimizer first, before any refactoring.

vlopezferrando · 2024-12-31T09:10:52Z

Hi Josh, thanks for your reply. I'll take this couple days to clean up the PR and implement some other improvements I still have in mind.

I understand that adding the optimizer is the priority, but if the refactor could cut code size by half and simplify the logic, it may actually be a good idea to do the refactor before adding new code. Your call!

I noticed when tweaking the code that tests are not completely exhaustive: some logic changes resulted in tests passing anyway. I wonder: is there some related project (from another programming language) that we could port tests from? If so, I'd be happy to contribute some tests too to make the refactor safer.

vlopezferrando added 23 commits December 30, 2024 23:16

extract initial stability from parameters

aafb71a

precompute initial difficulty

d9a9df9

compact fuzz ranges

9e32148

join functions

ae2bf1a

extract variable

b61551a

Extract hard_penalty and easy_bonus from parameters

00f9cac

Send card to _next_stability

9d9a209

simplified _next_difficulty

2edcee0

Simplify _next_interval

5148591

Reuse code. is this logic actually correct?

2fe0f5f

Remove unused code

c51a4ff

Join short and long term stability update

3fe2bd5

Simplify logic

70da887

make _next_interval return timedelta

d2aad37

Unify steps logic

82040a7

Send full card to _next_difficulty

7220e7a

Simplify conditions

1080968

Simplify

6bd8234

simplify conditions

f9413bd

Join condifionts

3a915da

Simplify logic for steps

e081d3c

Simplify initialization of stability and difficulty

c3dc0ec

Reorder comparison

52b860e

vlopezferrando added 5 commits December 31, 2024 10:29

Generalize steps for hard rating

60b6e6e

Add extra functions

efddaff

Inline function

8048a1e

Skip function call

8828a71

remove methods

20b4ffa

vlopezferrando added 30 commits January 1, 2025 01:02

Extract initial_difficutly method

9b4128f

Refactor next_difficulty

1be505b

Make explicit the usage of the first 4 parameters

5e6a285

Be more consistent

921d257

Remove docstring

851d372

Be more concise

1412d8c

Join ifs

c7d11e4

Small tidy

0dbde85

small tidy

7df9de3

small tidy

e43f0e4

Recover original formula

708dd37

extract algorithm to separate file

46133c0

split code in multiple files

fb449c0

rename file

a5874c3

add file

2a06860

Add docs to fsrs v5

5e2f45c

simplify model code

a960b4b

simplify comparison

9c0a393

convert datetimes to utc instead of raising an error

78ef106

join ifs

c86a522

remove decay as parameter

0201af2

remove clamp helper function

3829e03

simpler serialization

6450411

keep tz

f055a5d

slightly more robust

5f3ddc9

tidy

1119b8b

improve docstrings

73f03b3

remove outdated doc comment

17a5e54

rename variables for concuseness

24392cd

Recover get_retrievability for backwards compatibility

30b5386

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor code #75

Refactor code #75

vlopezferrando commented Dec 31, 2024 •

edited

Loading

joshdavham commented Dec 31, 2024

vlopezferrando commented Dec 31, 2024

Refactor code #75

Are you sure you want to change the base?

Refactor code #75

Conversation

vlopezferrando commented Dec 31, 2024 • edited Loading

Code split in 3 files

Test changes

Important refactors

joshdavham commented Dec 31, 2024

vlopezferrando commented Dec 31, 2024

vlopezferrando commented Dec 31, 2024 •

edited

Loading