kono/refo - refo - Lukas Mahler

Author	SHA1	Message	Date
Jan Löwenstrom	b0ca634b64	add every visit no jump results	2020-04-02 17:07:15 +02:00
Jan Löwenstrom	eca0d8db4d	create Dino Sampling state	2020-03-26 19:22:50 +01:00
Jan Löwenstrom	ee1d62842d	split Antworld into episodic and continuous task - add new simple state for jumping dino, to see if convergence is guarenteed with with state representation - changed reward structure for ant game	2020-03-15 16:58:53 +01:00
Jan Löwenstrom	6613e23c7c	Fixed new method name for MC	2020-03-02 23:19:54 +01:00
Jan Löwenstrom	584d6a1246	add javaFX gradle plugin and switch to java11 and add system.outs for error detecting - The current implementation will not converge to the correct behaviour. See comment in MonteCarlo class for more details	2019-12-10 15:37:20 +01:00
Jan Löwenstrom	db9b62236c	add logic to handle ant action and compute rewards - ant world will handle and compute action received by the agent - first try to convert observations to markov states - improved .equals() methods	2019-12-08 16:03:00 +01:00
Jan Löwenstrom	66ee33b77f	init the gradle project	2019-12-06 13:11:29 +01:00