Reinforcement learning enhanced heuristic search for combinatorial optimization door