A generic multi-agent reinforcement learning approach for scheduling problems door