Commit Graph

8 Commits

Author SHA1 Message Date
Jan Löwenstrom a40e279f48 change reward function for antgame to match BA 2020-04-04 14:41:58 +02:00
Jan Löwenstrom 740289ee2b add constant for default reward 2020-04-02 14:01:37 +02:00
Jan Löwenstrom f4b50627d1 add antGame analysis data and R Scripts and images 2020-03-29 17:22:47 +02:00
Jan Löwenstrom ee1d62842d split Antworld into episodic and continuous task
- add new simple state for jumping dino, to see if convergence is guarenteed with with state representation
- changed reward structure for ant game
2020-03-15 16:58:53 +01:00
Jan Löwenstrom 7db5a2af3b add fix RNG, add extended interface EpsilonPolicy and move rewardHistory to model instead of view
- only setting the seed of RNG once at the beginning and not reseeding it afterwards. Deep copying
the initial AntWorld to use as blueprint for resetting the world instead of reseeding and creating pesudo random again. Reseeding the RNG has influence action selecting to always
choose the same trajectory.
- instance of is used to determine if policy has epsilon or not and the view will adopt to this, only showing epsilon slider if policy has epsilon
2019-12-20 16:51:09 +01:00
Jan Löwenstrom 584d6a1246 add javaFX gradle plugin and switch to java11 and add system.outs for error detecting
- The current implementation will not converge to the correct behaviour. See comment in MonteCarlo class for more details
2019-12-10 15:37:20 +01:00
Jan Löwenstrom 55d8bbf5dc add Random-, Greedy and EGreedy-Policy and first implementation of monte carlo method
- fixed bug regarding wrong generation of hashCode. hashCodes needs to be equal across equal objects. Compute hashCode on final states once and return this value instead of computing it every time .hashCode() gets called.
-
2019-12-09 23:21:48 +01:00
Jan Löwenstrom db9b62236c add logic to handle ant action and compute rewards
- ant world will handle and compute action received by the agent
- first try to convert observations to markov states
- improved .equals() methods
2019-12-08 16:03:00 +01:00