Human trainers offer conversations and rank the responses. These reward types support decide the most effective answers. To keep schooling the chatbot, buyers can upvote or downvote its reaction by clicking on thumbs-up or thumbs-down icons beside The solution. Consumers may also provide supplemental penned feedback to further improve and good-tune