Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问评测的原理是什么呀,是人工打分的吗,如果是客观题是直接比较返回的答案的字符串,主观题是人工评判答案吗 #47

Open
starplatinum3 opened this issue Jul 26, 2024 · 2 comments

Comments

@starplatinum3
Copy link

No description provided.

@elmliu
Copy link

elmliu commented Aug 16, 2024

参考项目论文实验的Evaluation Metrics部分,封闭式选择题就是算准确度,开放式的题目用GPT4判断两两模型间谁的回答更优秀,计算每个模型的胜出率

@Cloud-Iris
Copy link

原论文中有详细的介绍,在这里:

image

在附录中也有一部分是关于 Evaluation Metrics 的。这个 issue 感觉可以 close 了。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants