An AlphaGo Moment for Urban Simulation Training Reinforcement Learning Agents on the Authentic Micropolis Engine