Commit Graph

12 Commits

Author SHA1 Message Date
Jan Löwenstrom b1d06293fe add shadowJar 2020-03-05 12:25:42 +01:00
Jan Löwenstrom 1f743cf8f2 fix eps/sec stat 2020-03-05 12:09:36 +01:00
Jan Löwenstrom 18d6e32f64 split DinoWorld between simple and advanced example 2020-03-05 11:58:57 +01:00
Jan Löwenstrom cffec63dc6 apply threading changes to master branch and clean up for tag version
- no testing or epsilon testing stuff
2020-03-05 11:49:51 +01:00
Jan Löwenstrom 0e4f52a48e first epsilon decaying method 2020-02-27 15:29:15 +01:00
Jan Löwenstrom cff1a4e531 add isJumping info to dinoState 2020-02-26 17:14:28 +01:00
Jan Löwenstrom 77898f4e5a add TD algorithms and started adopting to continous tasks
- add Q-Learning and SARSA
- more config variables
2020-02-17 13:56:55 +01:00
Jan Löwenstrom f4f1f7bd37 add QTableFrame and clickable states that display a gui
- remove org.javaTuple in favour of org.apache.common for tuples and circleQueue
- remove ViewListener from non-GUI Controller
- stateActionTable saves the last 10 states that changed. They will get displayed in QTable Frame
in JTextAreas
2020-01-01 23:54:18 +01:00
Jan Löwenstrom 518683b676 split GUI parts from controller into sub class 2019-12-31 14:43:40 +01:00
Jan Löwenstrom 195722e98f enhance save/load feature and change thread handling
- saving monte carlo did not include returnSum and returnCount, so it the state would be wrong after loading. Learning, EpisodicLearning and MonteCarlo classes are all overriding custom save and load methods, calling super() each time but including fields that are necessary to replace on runtime.
- moved generic episodic behaviour from monteCarlo to abstract top level class
- using AtomicInteger for episodesToLearn
- moved learning-Thread-handling from controller to model. Learning got one extra Leaning thread.
- add feature to use custom speed and distance for dino world obstacles
2019-12-29 01:12:11 +01:00
Jan Löwenstrom b2c3854b3a change RL-Controller initialization process and action space iterable
- no fake builder pattern anymore, moved needed fields into constructor
- add serializeUID
- action space extends iterable interface to simplify looping over all actions (and not returning the actual list)
2019-12-24 19:38:35 +01:00
Jan Löwenstrom 5a4e380faf add dino jumping environment, deterministic/reproducable behaviour and save-and-load feature
- add feature to save and load learning progress (Q-Table) and current episode count
- episode end is now purely decided by environment instead of monte carlo algo capping it on 10 actions
- using linkedHashMap on all locations to ensure deterministic behaviour
- fixed major RNG issue to reproduce algorithmic behaviour
- clearing rewardHistory, to only save the last 10k rewards
- added google dino jump environment
2019-12-22 23:33:56 +01:00