dc.contributor.author |
Dąbrowski, Bartosz |
|
dc.date.accessioned |
2023-01-26T11:45:16Z |
|
dc.date.available |
2023-01-26T11:45:16Z |
|
dc.date.issued |
2023-01-26 |
|
dc.identifier.issn |
2022/I/F/14 |
|
dc.identifier.uri |
https://repin.pjwstk.edu.pl/xmlui/handle/186319/2336 |
|
dc.description.abstract |
The aim of this article is to present the project about the use of Deep Q-learning and neuroevolution used by agent to learn how to control a lander for OpenAI Gym library, in LunarLander-v2 environment.
The beginning of the article concerns theoretical knowledge about machine learning, neural networks, reinforcement learning and the algorithms used by agent to learn how to control the lander. Next, the article is about the effects resulting from work on the project:
Lander’s environment, the script written in Python capable of learning how to control lander. |
pl_PL |
dc.language.iso |
other |
pl_PL |
dc.relation.ispartofseries |
;Nr 6846 |
|
dc.subject |
Evolutionary algorithm |
pl_PL |
dc.subject |
Supervised learning |
pl_PL |
dc.subject |
Unsupervised learning |
pl_PL |
dc.subject |
Back propagation |
pl_PL |
dc.subject |
Markov decision process |
pl_PL |
dc.subject |
Q-learning |
pl_PL |
dc.subject |
Q-network |
pl_PL |
dc.subject |
Deep Q-network |
pl_PL |
dc.subject |
Deep Q-learning |
pl_PL |
dc.subject |
Double DQN (Double DQN) |
pl_PL |
dc.subject |
Dueling DQN (DQN) |
pl_PL |
dc.subject |
Prior Experience Replay (PER) |
pl_PL |
dc.subject |
Neural Networks |
pl_PL |
dc.subject |
Reinforcement Learning |
pl_PL |
dc.subject |
Machine Learning |
pl_PL |
dc.subject |
Genetic Algorithms |
pl_PL |
dc.subject |
Genetic Programming |
pl_PL |
dc.subject |
Evolutionary Strategies |
pl_PL |
dc.subject |
OpenAI Gym |
pl_PL |
dc.subject |
Jacobians |
pl_PL |
dc.subject |
Hessians |
pl_PL |
dc.subject |
Neuroevolution |
pl_PL |
dc.title |
Porównanie efektywności Double DQN i neuroewolucji dla prostych problemów sterowania |
pl_PL |
dc.title.alternative |
Comparison of effectiveness of Double DQN and neuroevolution for simple control problems |
pl_PL |
dc.type |
Thesis |
pl_PL |