Simplify Skill implementation #3737

xefoci7612 · 2021-10-10T12:15:07Z

Currently we handle the UCI_Elo with a double randomization. This
seems not necessary and a bit involuted.

This patch removes the first randomization and unifies the 2 cases.

No functional change.

snicolet · 2021-10-18T19:04:11Z

Any feedback on this PR?

vondele · 2021-10-18T20:23:38Z

this looks reasonable, but would be good to see if things are actually equivalent, it is, for a given UCI_Elo a functional change. Probably a match could be done with master and patch for a given UCI_Elo, to check this doesn't introduce a large change.

Note that it also overlaps with what is discussed in #3635

xefoci7612 · 2021-10-23T07:45:29Z

Current implementation maps ELO values into a (float) level

double floatLevel = Options["UCI_LimitStrength"] ?
                      std::clamp(std::pow((Options["UCI_Elo"] - 1346.6) / 143.4, 1 / 0.806), 0.0, 20.0) :
                        double(Options["Skill Level"]);

And then converts the level into the upper or lower integer according to a statistical formula that takes into account how much the float differs from the lower bounding integer.

int intLevel = int(floatLevel) +
                 ((floatLevel - int(floatLevel)) * 1024 > rng.rand<unsigned>() % 1024  ? 1 : 0);

If you agree, I'd propose the following.

Verify smoothness: run tests among new patched engines (not vs master) with different UCI_Elo settings and verify that new formula is consistent, i.e. winning ratio increases smoothly with UCI_Elo
Verify mapping: run test of new patch against master but without enabling the float to int statistical tweak, because it would just add noise for the target of this test, and verify that the two engines have comparable strenght. To do this we could test patch vs master setting an UCI_Elo for which floatLevel - int(floatLevel) == 0, so that second formula becomes moot.

vondele · 2021-10-23T09:39:55Z

I would suggest to run just 3 matches patch against master, both engines with the same UCI_Elo at fixed values (e.g. 1800, 2000, 2200), each for e.g. 20k games, and see that the Elo difference is 'small'. This can be done on fishtest, passing additional options.

xefoci7612 closed this Oct 30, 2021

xefoci7612 force-pushed the master branch from 1089b9e to 7262fd5 Compare October 30, 2021 18:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify Skill implementation #3737

Simplify Skill implementation #3737

Uh oh!

xefoci7612 commented Oct 10, 2021

Uh oh!

snicolet commented Oct 18, 2021

Uh oh!

vondele commented Oct 18, 2021

Uh oh!

xefoci7612 commented Oct 23, 2021

Uh oh!

vondele commented Oct 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Simplify Skill implementation #3737

Simplify Skill implementation #3737

Uh oh!

Conversation

xefoci7612 commented Oct 10, 2021

Uh oh!

snicolet commented Oct 18, 2021

Uh oh!

vondele commented Oct 18, 2021

Uh oh!

xefoci7612 commented Oct 23, 2021

Uh oh!

vondele commented Oct 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants