| dc.contributor.author | Hewawasam, G. H. T. M. P | |
| dc.date.accessioned | 2022-03-07T06:27:55Z | |
| dc.date.available | 2022-03-07T06:27:55Z | |
| dc.date.issued | 2021 | |
| dc.identifier.citation | Hewawasam, G. H. T. M. P (2021) Boosting the Exploration of Reinforcement Learning Agent Through Human Demonstrations. BSc. Dissertation Informatics Institute of Technology | en_US |
| dc.identifier.issn | 2017414 | |
| dc.identifier.uri | http://dlib.iit.ac.lk/xmlui/handle/123456789/855 | |
| dc.description.abstract | " Exploration is a challenging task in reinforcement learning environments with sparse reward signals. This problem is exponential in long horizon tasks with complex state and action spaces which makes the applicability of reinforcement learning in real world problems out of practical use. This research work is to introduce a novel learn from demonstration approach to address this problem by combining human expertise and speed up the training process by minimizing the ignorant learning process of the reinforcement learning agent. Our method is built upon the Deep Deterministic Policy Gradient(DDPG) algorithm and Hindsight Experience replay(HER). This novel approach is successfully proven to outperform the learning efficiency of the baseline algorithm and account for the suboptimality of expert demonstration and overfitting bias of current approaches. " | en_US |
| dc.language.iso | en | en_US |
| dc.subject | Learning from Demonstrations | en_US |
| dc.subject | Reinforcement Learning | en_US |
| dc.title | Boosting the Exploration of Reinforcement Learning Agent Through Human Demonstrations | en_US |
| dc.type | Thesis | en_US |