Opik integration for manual annotation of LLM judge #134
Opik integration for manual annotation of LLM judge #134jacobthebanana wants to merge 8 commits intomainfrom
Conversation
Jacob-Chmura
left a comment
There was a problem hiding this comment.
Presumably we'll need to uv add opik
I have not used this framework before, but the integration seems reasonable. Will read more about it
|
Tests are fixed. Ready for review |
| # Log to Opik if provided. | ||
| for _dataset_idx, (dataset, stats) in enumerate(zip(datasets, results)): | ||
| for _sample_idx, sample in enumerate(dataset.samples): | ||
| if opik_client is not None: |
There was a problem hiding this comment.
Nit, but this check can go outside the loop
Jacob-Chmura
left a comment
There was a problem hiding this comment.
Are there any other scripts missing here? I.e. looping through and setting the env variables? Given the results of our manual annotation trail, I think this merge may be postponed in preference of automated eval via mechanical turks etc (for rebuttals). Will keep this open for the short term.
Jacob-Chmura
left a comment
There was a problem hiding this comment.
Thanks for updating everything. I'm not sure if we'll merge this as is, but we can keep this branch for the time being.
Log LLM Judge perspective to Comet Opik for manual annotation.
New environment variables:
TODO: