Quick Start Benchmarking
Execution
To run an experiment, simple execute the following command with configuration file path input
python3 light_malib/main_pb.py --config expr_configs/xxx/.../xxx.yaml
All the configuration files can be found in .expr_configs/, including those for academy scenarios, full-game scenarios and a PSRO trial.
Configuration file
Each configuration file define detailed settings for rollout, training, data storage, logging, model setup, population and more.
Framework Config
The framework section defines type of learning
|
maximum number of generation (only in PBT) |
|
type of meta solver (only in PBT) |
|
synchronous if True else asynchronous mode |
|
maximum number of rollout iteration |
Rollout Manager Config
The rollout manager section defines rollout settings
|
number of parallel rollout worker |
|
size of data batch collected |
|
rollout episode maximum length |
|
truncation length (no trucation if 0) |
|
configs of the environment |
Training Manager Config
The training manager section defines training settings
|
training batch size |
|
number of trainers (or GPU number) |
|
rollout episode maximum length |
|
optimizer type |
|
learning rate of the actor |
|
learning rate of the critic |
DataServer Config
The data server section defines data storage
|
data table size |
|
data sampling scheme |
|
maximum reusage of each sample |
Population Config
The population section define the whole population, including the trainable policies. For each algorithm:
Model config:
|
model type (actor critic type, feature encoder type) |
|
model initialization |
|
actor network setting |
|
critic network setting |
Custom config:
|
configs for feature encoder |
|
discount value |
|
training epoch |
Policy init cfg
|
population pool for agent 0 |
|
how agent 0 is initialize in each condition (random, pretrained or interit) |
|
population pool for agent 1 |
Monitoring & Saving
The learning statistics will be recorded in ./logs path, including the tensorboard file, saved config file, saved policies and others.