We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
For the qa_prompt:
qa_prompt
Q: What is my office's zip code? Options: A) 94107 B) 94106 C) cheesecake D) Insufficient information to answer this question E) -8
Q: What is my office's zip code?
Options: A) 94107 B) 94106 C) cheesecake D) Insufficient information to answer this question E) -8
And the qa_answer (where correct is A):
qa_answer
the answer is 94106 or 94107
The LLM evaluation says:
The single letter answer is B.
Which gets converted to "T", and declares it as LitQAEvaluation.INCORRECT.
"T"
LitQAEvaluation.INCORRECT
Even though LitQAEvaluation.INCORRECT is actually the right output, we got it right here for the wrong reasons.
The text was updated successfully, but these errors were encountered:
LitQAEvaluation
No branches or pull requests
For the
qa_prompt
:And the
qa_answer
(where correct is A):The LLM evaluation says:
Which gets converted to
"T"
, and declares it asLitQAEvaluation.INCORRECT
.Even though
LitQAEvaluation.INCORRECT
is actually the right output, we got it right here for the wrong reasons.The text was updated successfully, but these errors were encountered: