Model of the Environment
Model of the Environment
A model of the environment is a function which predicts state transitions and rewards. One of the most important branching points in an RL algorithm is the question of whether the agent has access to (or learns) a model of the environment. Algorithms which use a model are called model-based methods, and those that don’t are called model-free.
- Pros of Model-Based Methods:
- Enables agents to simulate potential outcomes before acting
- Allows strategic thinking and systematic option evaluation
- Improves decision-making efficiency
- Can substantially enhance sample efficiency
- Cons of Model-Based Methods:
- Ground-truth environmental models are rarely available
- Requires learning models purely from experience
- High risk of model bias leading to suboptimal real-world performance
- Model-learning is inherently challenging
- Significant time and computational investment may not yield results