The principle of "optimism in the face of uncertainty" is proposed to solve the exploration-exploitation trade-off. Many algorithms in reinforcement learning that solve the tradeoffs follow this principle. (*8)
In the case of optimistic misunderstanding, they play that slot machine again. After several times of play. They find that the probability is not as good as the experience.
They do not have a chance to correct pessimistic misunderstandings, but they have a chance to correct optimistic misunderstanding later. In other words, pessimistic misunderstandings are more severe than optimistic misunderstandings. Therefore, by making judgments optimistic, they are well-balanced. It is the principle of "optimism in the face of uncertainty."
*8: Bubeck, S. and Cesa-Bianchi, N., 2012. Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Foundations and Trends® in Machine Learning, 5(1), pp.1-122.
→task×choose_a_task×prioritize×prioritization×(2.2.1)_a_burden_of_sorting×(2.2.2)_we_can_not_compare_magnitude_unless_it_is_one_dimension×(2.2.3) We can not compare magnitude when there is an uncertain factor?×(2.2.4)_prioritize_important_tasks×(2.2.5)_you_do_not_have_to_determine_the_priority_now→