All posts Catergories Cloud Tags Cloud List by Date

Fine-Tuning Languae Models From Human Preferences

FTLMHP

This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled Fine-Tuning Langauge Models from Human Preferences (Ziegler et al., arXiv 2020), that I read and studied. [Read More]
Tags: LLM, Feedback, Reward