"In the ever-evolving landscape of Natural Lan-
guage Generation (NLG) evaluation, a noteworthy
paradigm shift is underway as researchers increas-
ingly turn their attention towards fine-tuning open-
source language models (e.g., LLaMA), in lieu of
traditional closed-based LLMs like ChatGPT and
GPT-4. This transformative shift is propelled by a
thorough exploration of key perspectives, including
the expenses associated with API calls, the robust-
ness of prompting, and the pivotal consideration of
domain adaptability."<p>This paper was written by an LLM. Probably Claude-3.