As the other examples we visualize a state
WWWWWWWWWWWWOOOOOOOYOOOYOOOYOOOYGGGGGGGGGGGGGGGGWRRRWRRRWRRRWRRRBBBBBBBBBBBBBBBBRRRRYYYYYYYYYYYY frontCounterClockwise(2) 100 WWWWWWWWWWWWWWWWOOOOOOOOOOOOOOOOGGGGGGGGGGGGGGGGRRRRRRRRRRRRRRRRBBBBBBBBBBBBBBBBYYYYYYYYYYYYYYYY WWWWWWWWWWWWRRRROOOWOOOWOOOWOOOWGGGGGGGGGGGGGGGGYRRRYRRRYRRRYRRRBBBBBBBBBBBBBBBBOOOOYYYYYYYYYYYY frontClockwise(1) 100 WWWWWWWWWWWWWWWWOOOOOOOOOOOOOOOOGGGGGGGGGGGGGGGGRRRRRRRRRRRRRRRRBBBBBBBBBBBBBBBBYYYYYYYYYYYYYYYY RRRRWWWWWWWWWWWWWOOOWOOOWOOOWOOOGGGGGGGGGGGGGGGGRRRYRRRYRRRYRRRYBBBBBBBBBBBBBBBBYYYYYYYYYYYYOOOO backCounterClockwise(4) 100 WWWWWWWWWWWWWWWWOOOOOOOOOOOOOOOOGGGGGGGGGGGGGGGGRRRRRRRRRRRRRRRRBBBBBBBBBBBBBBBBYYYYYYYYYYYYYYYY OOOOWWWWWWWWWWWWYOOOYOOOYOOOYOOOGGGGGGGGGGGGGGGGRRRWRRRWRRRWRRRWBBBBBBBBBBBBBBBBYYYYYYYYYYYYRRRR backClockwise(3) 100 WWWWWWWWWWWWWWWWOOOOOOOOOOOOOOOOGGGGGGGGGGGGGGGGRRRRRRRRRRRRRRRRBBBBBBBBBBBBBBBBYYYYYYYYYYYYYYYY WGWWWGWWWGWWWGWWOOOOOOOOOOOOOOOOGYGGGYGGGYGGGYGGRRRRRRRRRRRRRRRRBBWBBBWBBBWBBBWBYBYYYBYYYBYYYBYY innerLeftDown(6) 100 WWWWWWWWWWWWWWWWOOOOOOOOOOOOOOOOGGGGGGGGGGGGGGGGRRRRRRRRRRRRRRRRBBBBBBBBBBBBBBBBYYYYYYYYYYYYYYYY
Syntax: state action reward next-state
Results (virtually it can solve the problem, you just need a huge amount of storage):
As we observed our agent can solve problems but depends with the data given. next we will apply qlearning in a famous game flappy bird.
Lets use q-learning for flappy bird game