Handbook Of Learning And Approximate Dynamic Progr Amming door Andrew G. Barto, Jennie Si & Andy Barto