Optimal Behavior is Easier to Learn than the Truth.
Ronald OrtnerPublished in: Minds and machines (2016)
We consider a reinforcement learning setting where the learner is given a set of possible models containing the true model. While there are algorithms that are able to successfully learn optimal behavior in this setting, they do so without trying to identify the underlying true model. Indeed, we show that there are cases in which the attempt to find the true model is doomed to failure.
Keyphrases