Jan Löwenstrom
|
b0ca634b64
|
add every visit no jump results
|
2020-04-02 17:07:15 +02:00 |
Jan Löwenstrom
|
eca0d8db4d
|
create Dino Sampling state
|
2020-03-26 19:22:50 +01:00 |
Jan Löwenstrom
|
ee1d62842d
|
split Antworld into episodic and continuous task
- add new simple state for jumping dino, to see if convergence is guarenteed with with state representation
- changed reward structure for ant game
|
2020-03-15 16:58:53 +01:00 |
Jan Löwenstrom
|
6613e23c7c
|
Fixed new method name for MC
|
2020-03-02 23:19:54 +01:00 |
Jan Löwenstrom
|
584d6a1246
|
add javaFX gradle plugin and switch to java11 and add system.outs for error detecting
- The current implementation will not converge to the correct behaviour. See comment in MonteCarlo class for more details
|
2019-12-10 15:37:20 +01:00 |
Jan Löwenstrom
|
db9b62236c
|
add logic to handle ant action and compute rewards
- ant world will handle and compute action received by the agent
- first try to convert observations to markov states
- improved .equals() methods
|
2019-12-08 16:03:00 +01:00 |
Jan Löwenstrom
|
66ee33b77f
|
init the gradle project
|
2019-12-06 13:11:29 +01:00 |