MLGdańsk #137 – Overview of Reinforcement Learning from Human Feedback

Serdecznie zapraszamy na 137. spotkanie MLGdańsk – Poniedziałek, 6 marca 2023 o godzinie 18.00 CET.

Spotkanie w języku angielskim. Szczegóły poniżej.

link do spotkania:
https://meet.jit.si/MLGdansk_06032023_nb137

Prelegentem będzie:
Nikita Pavlichenko
Toloka.AI

Temat spotkania:
“Overview of Reinforcement Learning from Human Feedback

Nikita Pavlichenko is a Research Scientist at Toloka, where he works on connecting deep learning models with the crowdsourcing and human evaluation of generative models.

Opis prelekcji:
Recent progress on Large Language Models introduced a new machine learning technique: reinforcement learning from human feedback (RLHF). RLHF powers models such as InstructGPT and ChatGPT.

In this talk, we are going to provide an introduction to why this technique is necessary, the basics of reinforcement learning, why RL needs human feedback, and how to solve practical tasks with RLHF.

Serdecznie zapraszamy – spotkanie otwarte dla wszystkich zainteresowanych!