Reinforcement learning with human feedback (Q2571): Difference between revisions
(Created a new Item: Reinforcement learning with human feedback, RLHF) |
(Changed [de] label: Reinforcement learning with human feedback) |
||
| (One intermediate revision by the same user not shown) | |||
| label / de | label / de | ||
Reinforcement learning with human feedback | |||
Latest revision as of 12:50, 13 October 2025
RLHF
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Reinforcement learning with human feedback |
RLHF |