Tag: MDP (Markov Decision Process)

MDPs are the mathematical models that define how RL environments work. This tag provides clean explanations, diagrams, and examples that show how states, actions, rewards, and transitions form the foundation of every RL algorithm.