(ITI) Inference-Time Intervenction- Eliciting Truthful answers from a Language Model

ITI

Posted on October 14, 2023

(ITI) Inference-Time Intervenction- Eliciting Truthful answers from a Language Model

ITI

Posted on October 14, 2023

This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (Li et al., arXiv 2023), that I read and studied.

They propose adapting the truthfulness of LLMs using attetnion head.

Li et al. ArXiv 2023

ITI (Inference-time Intervention) is an alternative form of MHA, where:

Li et al. ArXiv 2023

But, ITI is supervised learning, they used the activation editing.

You can see the detailed empirical analysis and experiemtn in the paper, titled Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (Li et al., arXiv 2023)

For detailed experiment and explanation, refer to the paper, titled Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (Li et al., arXiv 2023)

Download URL:
The paper: Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (Li et al., arXiv 2023)

Reference

Paper
- NuerIPS Version: Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (Li et al., NeurIPS 2023)
- ArXiv Version: Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (Li et al., arXiv 2023)
How to use html for alert
- how to use icon
How to use MathJax
- MathJax basic tutorial and quick reference in StackExchange

Tags: LLM, Deconding, Factuality