* startingState -> MOVE_LEFT : very first state action in the episode i = 1 - * image the agent does not collect the food and drops it to the start, the agent will receive + * image the agent does not collect the food and does not drop it onto start, the agent will receive * -1 for every timestamp hence (startingState -> MOVE_LEFT) will get a value of -10; *
* BUT image moving left from the starting position will have no impact on the state because
@@ -30,12 +30,12 @@ import java.util.*;
*
* @param
*/
-public class MonteCarloControlEGreedy extends EpisodicLearning {
+public class MonteCarloControlFirstVisitEGreedy extends EpisodicLearning {
private Map