This is a reinforcement learning adaptation of the Harry Potter Snitch game. The goal is for the Harry Potter figure to grab the snitch. When the game begins, the agent explores, randomly using the grabbing command. It misses the snitch mostly, but it averages the first 5 hits into an ideal distance away from the snitch to grab. After 5 hits, it has learned the ideal distance and will not miss the snitch again.