The Chasm Arena
MAX (Traveler)
0
Active Turn
MAX
MIN (Specter)
0
Chronicle of Moves
Chasm environment initialized.
The Decision Tree Explorer
States Checked: 0
Branches Cut: 0
Value at Root: 0.0
Heuristics & Learning Labs
Initialize Environment Parameters
Bridge Hazards Key:
Treasure (+2)
Void Trap (-3)
Slippery (Double Slip)
Trains a model-free temporal difference agent to learn optimal actions without tree lookahead.
Agent status: Untrained (Using default policies).
Evolves Heuristic evaluation weights via tournament round-robin matches.
Best Heuristic Weights Evolved:
Gold Weight: 2.00
Trap Weight: -3.00
Progress Weight: 1.50
Risk Weight: -2.00