ba_thesis

Code implemented for my BSc thesis "Optimal planning under Model Uncertainty" at TU Darmstadt 2018

Abstract

Bayesian model-based reinforcement learning is an elegant formulation to learning and planning optimal behavior under model uncertainty. In this work an extension to the Markov decision process model (MDP), used throughout in the field of reinforcement learning, is studied. The formalism of Bayes- Adaptive Markov decision processes (BAMDP) allows an intrinsic representation of model uncertainty and gathered information for action-selection. Thus solving a BAMDP is equivalent to finding an optimal exploration / exploitation tradeoff in the underlying MDP. I reviewed to approaches to solving BAMDPs: one offline approach based on policy improvement for stochastic finite-state controllers and one online approach using sample-based tree-search with given heuristics. I applied both approaches to two problems famous in literature: the chain problem and the four-dimensional queueing network.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
bamcp.py		bamcp.py
chain.py		chain.py
definition_MDP.py		definition_MDP.py
fsc.py		fsc.py
license.txt		license.txt
queue4D.py		queue4D.py
references.bib		references.bib
runExperiments.py		runExperiments.py
sampleMDP.py		sampleMDP.py
sarsa.py		sarsa.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ba_thesis

About

Releases

Packages

Languages

License

i9e1/ba_thesis

Folders and files

Latest commit

History

Repository files navigation

ba_thesis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages