I'm Hyunyoung2

Reinforcing Large Language Model Perforemance through Retrieval-Augmented Generation with Multiple Partitions

M-RAG

Posted on August 20, 2024

This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled M-RAG: Reinforcing Large Language Model Perforemance through Retrieval-Augmented Generation with Multiple Partitions (Wang et al., ACL 2024), that I read and studied. [Read More]

Tags: LLM, Reward, RAG

Self-Alignment with Instruction Backtranslation

Self-Alignment

Posted on August 6, 2024

This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled Self-Alignment with Instruction Backtranslation (Li et al., ICLR 2024), that I read and studied. [Read More]

Tags: LLM, Feedback, Reward

Self-Rewarding Language Models

Self-Rewarding

Posted on August 5, 2024

This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled Self-Rewarding Language Models (Yuan et al., arXiv 2024), that I read and studied. [Read More]

Tags: LLM, Feedback, Reward

Meta-Rewarding Language Models - Self-Improving Alignment with LLM-as-a-Meta-Judge

Meta-Rewarding

Posted on August 5, 2024

This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge (Wu et al., arXiv 2024), that I read and studied. [Read More]

Tags: LLM, Feedback, Reward

Length-Controlled AlpacaEval - A Simple Way to Debias Automatic Evaluators

alpacaeval 2.0

Posted on July 20, 2024

This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled Length-Controleed AlpacaEval: A Simple Way to Debias Automatic Evaluators (Dubois et al., arXiv 2024), that I read and studied. [Read More]

Tags: LLM, LLMEval, Reward