I find either theories or python example which is not satisfactory as a beginner. I just need to understand a simple example for understanding the step by step iterations. Could anyone please show me the 1st and 2nd iterations for the Image that I have uploaded for value iteration? Grid world problem
I recommend this PDF: http://www.cis.upenn.edu/~cis519/fall2015/lectures/14_ReinforcementLearning.pdf, which is very clear about the grid world problem. And there are codes on github:
https://github.com/kevlar1818/grid-world-rl
Hope those help.