The page "reinforcement learning from human feedback" does not exist in ja language.
Try English version