a model-based approach for continuous-time policy evaluation with unknown lévy process dynamics

printable pdf

比利时vs摩洛哥足彩 ,
university of california san diego

****************************

math 278a - center for computational mathematics seminar

qihao ye

ucsd

a model-based approach for continuous-time policy evaluation with unknown lévy process dynamics

abstract:

this research presents a framework for evaluating policies in a continuous-time setting, where the dynamics are unknown and represented by lévy processes. initially, we estimate the model using available trajectory data, followed by solving the associated pde to conduct the policy evaluation. our approach encompasses not only the conventional brownian motion but also the non-gaussian and heavy-tailed lévy processes. we have developed an algorithm that demonstrates enhanced performance compared to existing techniques tailored for brownian motion. furthermore, we provide a theoretical guarantee regarding the error in policy evaluation given the model error. experimental results involving both light-tailed and heavy-tailed data will be presented. this research provides a first step to continuous-time model-based reinforcement learning, particularly in scenarios characterized by irregular, heavy-tailed dynamics.

november 28, 2023

11:00 am

ap&m 2402 and zoom id 915 4615 4399

****************************

比利时vs摩洛哥足彩 , university of california san diego

math 278a - center for computational mathematics seminar

qihao ye

ucsd

a model-based approach for continuous-time policy evaluation with unknown lévy process dynamics

abstract:

november 28, 2023

11:00 am

ap&m 2402 and zoom id 915 4615 4399

比利时vs摩洛哥足彩 ,
university of california san diego