Deterministic benchmarking for randomized games #104

casper2002casper · 2022-03-02T13:52:49Z

When game environments are randomized, some instances will result in larger rewards than others, resulting in noise in the benchmarks. Using enough games can average this out, but this is inefficient for games which are slow to solve. This PR adds a random generator to the init function, which can be used to construct the environment. As the seed of the RNG is using the sim_id the same environments will be generated for each iteration benchmark.
Ideally, the implementation influences non-randomized games as little as possible.

casper2002casper added 5 commits March 2, 2022 14:41

Add param

0057c92

Use in benchmark

3bab31a

Add common_rl method

e9f201f

Add to games

e887381

enable for gridworld

8b6d79e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Deterministic benchmarking for randomized games #104

Deterministic benchmarking for randomized games #104

Uh oh!

casper2002casper commented Mar 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Deterministic benchmarking for randomized games #104

Are you sure you want to change the base?

Deterministic benchmarking for randomized games #104

Uh oh!

Conversation

casper2002casper commented Mar 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant