Abstract: Markov decision processes (MDPs) are widely used for modeling sequential decision-making problems under uncertainty. We propose an online algorithm for solving a class of average-reward MDPs ...
University of Pennsylvania researchers tweaked an AI tutor to tailor the difficulty of practice problems for each student.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results