Skip to content

Release v0.9.0: CLI improvements, single_hop rename, and custom schema demo#198

Merged
sumukshashidhar merged 7 commits intomainfrom
version-bump
Dec 29, 2025
Merged

Release v0.9.0: CLI improvements, single_hop rename, and custom schema demo#198
sumukshashidhar merged 7 commits intomainfrom
version-bump

Conversation

@sumukshashidhar
Copy link
Collaborator

Summary

This PR bumps YourBench to version 0.9.0 with significant CLI improvements and new features.

Changes

Version Bump

  • Bumped version to 0.9.0

CLI Improvements

  • Added Rich-based CLI with progress tracking
  • New commands: yourbench stages, yourbench validate, yourbench estimate, yourbench init
  • Token estimation before running pipelines using tiktoken

Naming Convention

  • Renamed single_shot to single_hop throughout the codebase for consistency
  • Updated all prompts, configs, docs, and tests

New Example

  • Added custom_schema_demo example showing Pydantic output control
  • Includes technical_qa.py and educational_assessment.py schemas
  • Demonstrates custom fields: difficulty, prerequisites, key_concepts, bloom_level

Files Changed

  • 42 files changed
  • New: yourbench/utils/token_estimation.py
  • Renamed: single_shot.pysingle_hop.py
  • New example: example/custom_schema_demo/

Testing

  • All 91 tests pass
  • Pipeline tested with multiple examples
  • Custom schema demo verified on HuggingFace Hub

Datasets Created (for testing)

  • yourbench-testing/yourbench-cli-test-1
  • yourbench-testing/yourbench-cli-test-harry-potter
  • yourbench-testing/yourbench-cli-test-custom-prompts
  • yourbench-testing/yourbench-custom-schema-demo

- Add beautiful banner and Rich tables for output
- Add `yourbench validate` command to check configs without running
- Add `yourbench init` for interactive config generation
- Add `yourbench stages` to list available pipeline stages
- Add progress bars with timing for pipeline execution
- Add `-q/--quiet` and `--no-banner` flags to run command
- Show stage timing summary after pipeline completion
- Rename single_shot to single_hop throughout codebase for clarity
- Add 'yourbench estimate' command to estimate token usage before running
- Add yourbench/utils/token_estimation.py with tiktoken-based estimation
- Update all configs, docs, and tests to use single_hop terminology
@sumukshashidhar sumukshashidhar merged commit 4680764 into main Dec 29, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant