- Concepts used in [reinforcement learning](/en/reinforcement%20learning).
- exploitation-exploration trade-offs
- There are variations on whether to say "use" or "exploration" first and how to translate "exploitation.
- [Trade-off between exploration and exploitation](/en/Trade-off%20between%20exploration%20and%20exploitation) (I had a notation quirk myself.)
- Trade-off between use and exploration 62200 98
- Trade-off between search and use 50900 294
- The Dilemma of Search and Knowledge Use
If one chooses an option that seems useful, the opportunity to discover that other options are more useful is lost.
On the other hand, if they repeatedly search for more useful options, they will not benefit from the useful options they have found.
Significant development in the field of reinforcement learning, but first published much older. - George Edward Pelham Box
Also used in the area of organizational learning
relevance - filter bubble [filter bubble - Wikipedia https://ja.wikipedia.org/wiki/%E3%83%95%E3%82%A3%E3%83%AB%E3%82%BF%E3%83%BC%E3%83%90%E3%83%96%E3% 83%AB] - By getting information through a filter that matches your ideology, you get stuck in the local minimum of the ideology. - selection and concentration - Often used as an excuse to reduce search costs
#No pictured blind spot card yet 1031
BELOW_IS_AI_GENERATED
2023-09-05 01:15
. This section describes the "trade-off between search and use" in reinforcement learning. By choosing a useful alternative, one may miss other possibilities, and by searching for new alternatives, one may not benefit from known useful alternatives. This concept has been applied in other areas such as organizational learning and information filtering.
. The fragment "Revised Differences for the Fourth Printing" describes the trade-off between exploration and exploitation and is directly related to the note. Specifically, it states, "If you only choose the option that you think is best based on past experience, you will never find a better option. That is not enough exploration." This part of the note is consistent with the main theme of the note.
The trade-off between exploration and exploitation represents a balance between pursuing new possibilities and maximizing the benefits from known useful alternatives. This is an important consideration when choosing how to obtain information and learn.
. The trade-off between exploration and exploitation represents a balance between new possibilities and known benefits."
["Revision differences for the fourth printing", "Proofreading for the English version (chapters 2 and 3)", "Utilization-exploitation tradeoff", "(2.2.3.1) Exploration-exploitation tradeoff", "Disabling 🌀nominalization", "Flow and utilization and exploration", " Intellectual production techniques for engineers All hierarchical table of contents", "(6.2.2.2) Advantages and disadvantages of framework"]
generated: 2023-09-05 01:15This page is auto-translated from /nishio/利用と探索のトレードオフ using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers. Trade-offs between use and exploration see Trade-offs between use and exploration
This page is auto-translated from /nishio/利用と探索のトレード・オフ using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers.