LLM reward models represent powerful tools, but they're imperfect. Snorkel AI researcher Tom Walshe explains what happened in one Snorkel AI experiment, and why you should never fully trust LLM reward models.
#largelanguagemodels #ai #rewardmodels