Laterre, A;
Fu, Y;
Jabri, MK;
Cohen, A-S;
Kas, D;
Hajjar, K;
Dahl, TS;
... Beguir, K; + view all
(2018)
Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization.
Advances in Neural Information Processing Systems 31 (NeurIPS 2018)