ranked reward alernatives? #24

pstjohn · 2020-10-04T17:16:09Z

Might be worth exploring our own versions of ranked rewards. I don't expect that paper is a canonical standard at this point, so there might be further optimizations we could look into.

Do loss rewards need to be -1? Might be easier to make a loss 0 and win 1.

A couple ideas:

pareto front optimization
Here, we'd just check to see if a molecule is pareto-optimal, i.e., whether its dominated by any other recent solution currently in the game buffer.
select count(*) > 0 from game_table where buried_vol > %s and max_spin < %s and atom_type = %s
percentile scoring
Similar to ranked rewards but perhaps less sparse, just return 0->1 rewards depending on the percentile score of the game w.r.t a game buffer
anything else from the literature?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ranked reward alernatives? #24

ranked reward alernatives? #24

pstjohn commented Oct 4, 2020

ranked reward alernatives? #24

ranked reward alernatives? #24

Comments

pstjohn commented Oct 4, 2020