Specter's Bridge

A First-Principles Laboratory of Stochastic Adversarial Search, RL, and Evolution

Planks: 12 Slip Prob: 0.20 Mode: Normal Play

The Chasm Arena

MAX (Traveler) 0
Active Turn MAX
MIN (Specter) 0

Chronicle of Moves

Chasm environment initialized.

The Decision Tree Explorer

Star2 Pruning
States Checked: 0 Branches Cut: 0 Value at Root: 0.0
MAX Node MIN Node Chance Node Pruned

Heuristics & Learning Labs

Initialize Environment Parameters

Bridge Hazards Key:

Treasure (+2)
Void Trap (-3)
Slippery (Double Slip)
Trains a model-free temporal difference agent to learn optimal actions without tree lookahead.
Agent status: Untrained (Using default policies).
Evolves Heuristic evaluation weights via tournament round-robin matches.

Best Heuristic Weights Evolved:

Gold Weight: 2.00
Trap Weight: -3.00
Progress Weight: 1.50
Risk Weight: -2.00