Score contribution per author:
α: calibrated so average coauthorship-adjusted count equals average raw count
Reinforcement-learning pricing algorithms sometimes converge to supra-competitive prices even in markets where collusion is impossible by design or cannot be an equilibrium outcome. We analyze when such spurious collusion may arise, and when instead the algorithms learn genuinely collusive strategies, focusing on the role of the rate and mode of exploration.