Preventing AI Cheating
01
Training Basics
Key Concept
When AI Finds Loopholes
- AI learns by earning points for good behavior.
- Sometimes it finds 'cheats' to get points without doing the actual work.
- Researchers call this problem 'Reward Hacking' or 'Specification Gaming'.
1 / 8
