DeepSky ATC: 50-Agent Coordination Verified

March 21, 2026

Milestone: GAT Successfully Scaled to 50 Agents

Reward validation:

Theoretical maximum: +25,000 total (+350 progress for 70km diagonal flights + 150 bonus/solo × 50 agents)
Empirical success: episode_reward_max reaching +24,130
Mean reward climbing to +16,000

Performance metrics:

This proves the GAT architecture successfully generalized the “Zipper Merge” logic from 8 agents to 50.

High mean reward indicates the policy is robust across random spawn geometries. Not just cherry-picked success cases.

Random scenario, 50 agents:

Agents spawn at random positions with random headings. Must navigate to center goal zone while maintaining separation (5 NM minimum).

GAT attention mechanism enables coordination at scale. The policy learned to:

Prioritize high closure-rate threats over nearby parallel traffic
Coordinate zipper merge in dense airspace
Maintain 20% observability (K=10 neighbors out of 49) without catastrophic performance degradation

Final GAT production run complete. Ready for baseline comparisons.