This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled Search-R1 - Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning (Jin et al. arXiv 2025), that I read and studied.

The following is the example of prompt for Reinforcement Learning on Searh-R1.

Jin et al. arXiv 2025

For detailed experiment and explanation, refer to the paper, titled Search-R1 - Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning (Jin et al. arXiv 2025)

Reference