Skip to content

Comments

Add GPT-OSS-20B benchmark results for EHRSQL dataset#57

Merged
rajna-fani merged 1 commit intomainfrom
gpt-oss-20b-benchmark
Oct 1, 2025
Merged

Add GPT-OSS-20B benchmark results for EHRSQL dataset#57
rajna-fani merged 1 commit intomainfrom
gpt-oss-20b-benchmark

Conversation

@rafiattrach
Copy link
Owner

  • Add GPT-OSS-20B benchmark CSV with model answers extracted from conversations
  • Add one hundred conversation JSON files (2.conversation.json to 101.conversation.json)
  • Reorganize Claude Sonnet 4 benchmark into separate folder
  • Update README with benchmark overview and structure
  • Include correct/incorrect annotations with detailed notes

- Add GPT-OSS-20B benchmark CSV with model answers extracted from conversations
- Add 100 conversation JSON files (2.conversation.json to 101.conversation.json)
- Reorganize Claude Sonnet 4 benchmark into separate folder
- Update README with benchmark overview and structure
- Include correct/incorrect annotations with detailed notes
@rajna-fani rajna-fani merged commit 267f2a0 into main Oct 1, 2025
3 checks passed
@rafiattrach rafiattrach deleted the gpt-oss-20b-benchmark branch October 1, 2025 13:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants