Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Reinforcement Learning from Human Feedback (rlhfbook.com)
131 points by onurkanbkrc 4 days ago | hide | past | favorite | 5 comments




Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

You could say he's also learning from human feedback

Related. Others?

RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)


Web version with links, etc:

https://rlhfbook.com/


Thanks! We've switched to that above from https://arxiv.org/abs/2504.12501, and put the latter in the toptext.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: