We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我们在进行测试后,发现这样的评估方法和位置关系相当密切,把答案进行换位重评后的结果与先前有很大差异。 在使用GPT4等进行评估时,需要进行多次样本翻转来去除位置敏感性。
The text was updated successfully, but these errors were encountered:
您说的很对,我们当时太穷了,没有那么多资源来跑gpt4/chatgpt。如果可能的话,最好还是翻转位置关系,求平均。
Sorry, something went wrong.
想请问一下您为什么想到用ChatGPT做评估呢?考虑RW曾在额外的偏好数据上训练过,效果应当更好呀
No branches or pull requests
我们在进行测试后,发现这样的评估方法和位置关系相当密切,把答案进行换位重评后的结果与先前有很大差异。
在使用GPT4等进行评估时,需要进行多次样本翻转来去除位置敏感性。
The text was updated successfully, but these errors were encountered: