Laterre, A; Fu, Y; Jabri, MK; Cohen, A-S; Kas, D; Hajjar, K; Dahl, TS; ... Beguir, K; + view all (2018) Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization. Advances in Neural Information Processing Systems 31 (NeurIPS 2018) Green open access