
Modelling real-life systems and phenomena using mathematical based formalisms is ubiquitous in science and engineering. The reason is that mathematics oer a suitable framework to carry out formal and rigorous analysis of these systems. For instance, in software engineering, formal methods are among the most ecient tools to identify aws in software. The behavior of many real-life systems is inherently stochastic which require stochastic models such as labelled Markov processes (LMPs), Markov decision processes (MDPs), predictive state representations (PSRs), etc. This thesis is about quantifying the dierence between stochastic systems. The important point of the thesis is that reinforcement learning (RL), a branch of articial intelligence particularly ecient in presence of uncertainty, can be used to quantify eciently the divergence between stochastic systems. The key idea is to dene an MDP out of the systems to be compared and then to interpret the optimal value of the MDP as the divergence between them. The most appealing feature of the proposed approach is that it does not rely on the knowledge of the internal structure of the systems.
Page Count:
164
Publication Date:
2012-02-01
ISBN-10:
3847339710
ISBN-13:
9783847339717
No comments yet. Be the first to share your thoughts!