SP16: REINFORCEMENT LEARNING FOR AI: 32711