Tag: MDP

MDP is the abbreviation for Markov Decision Process, the formal model used in Reinforcement Learning to describe fully observable decision-making problems defined by states, actions, transitions, rewards, and a discount factor.