Discussion about this post

User's avatar
Daniel Popescu / ⧉ Pluralisk's avatar

This article comes at the perfect time. The issues you highlight with pre-trained LLMs and objective misalignment are crucial. Thanks for this excellent, detailed guie on RLHF; it's trully important work.

Expand full comment
1 more comment...

No posts