feat: Update SS with evals for Grade 5-12 #5

adnanrhussain · 2026-01-10T00:52:33Z

This PR extends the Sentence Structure Evaluator (notebook) to support additional grades (5 to 12) beyond the existing grades (3 & 4).

Status: This PR is currently under internal review and testing, before it is ready to be released

gary-mu

Love this integration of grade 5-12.
2 comments:

Can we also add Ariena as reviewer? She's not a member of this repo, so I can't add her
Can we add test result? I think running test passages and paste result can be sufficient.

aychi1 · 2026-01-16T00:14:12Z

Overall LGTM. Two small callouts:

Some duplicative fields between Gr 3-4 and 5-12 (e.g. num_compound vs num_compound_sentences, perc_simple_sentences vs. perc_simple, etc).
Prompt definition for simple sentence is not exactly the same between Noah's version and Wayne's version, and this isn't captured in the PR. Noah's version specifically mentions that simple sentences with relative clauses still count as simple. It likely won't make a big difference, but just documenting here that this is a known departure.

aychi1

Overall LGTM. Commented on some non-blocking nits.

aychi1 · 2026-01-16T00:00:38Z

evals/sentence_structure_evaluator.ipynb

    "        description=\"Max number of clauses (independent + subordinate) found in a single sentence.\"\n",
    "    )\n",
+    "    # (Grades 5-12)\n",
+    "    num_compound: int = Field(\n",


Just curious, is there a reason why we have both num_compound_sentences (line 173) and num_compound (line 259)? Asking bc I noticed that the definition is the same across both Grades 3-4 vs. 5-12 evals.

On a similar note, the definition for Simple Sentences is slightly different between Noah's prompt vs. Wayne's prompt. It likely does not make a big difference, but just want to call out that we've never tested.

I think it might be good to keep the prompt for each version as untouched as possible as we know LLM output is sensitive to prompt changes.

feat: Update SS with evals for Grade 5-12

58107a9

adnanrhussain requested a review from gary-mu January 10, 2026 00:52

gary-mu reviewed Jan 12, 2026

View reviewed changes

adnanrhussain requested a review from aychi1 January 13, 2026 20:30

aychi1 approved these changes Jan 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Update SS with evals for Grade 5-12 #5

feat: Update SS with evals for Grade 5-12 #5

adnanrhussain commented Jan 10, 2026

Uh oh!

gary-mu left a comment

Uh oh!

aychi1 commented Jan 16, 2026

Uh oh!

aychi1 left a comment

Uh oh!

aychi1 Jan 16, 2026

Uh oh!

gary-mu Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: Update SS with evals for Grade 5-12 #5

Are you sure you want to change the base?

feat: Update SS with evals for Grade 5-12 #5

Conversation

adnanrhussain commented Jan 10, 2026

Uh oh!

gary-mu left a comment

Choose a reason for hiding this comment

Uh oh!

aychi1 commented Jan 16, 2026

Uh oh!

aychi1 left a comment

Choose a reason for hiding this comment

Uh oh!

aychi1 Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

gary-mu Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants