Skip to content

TTA*: Test-Time A* Search — training-free, test-time tree search that boosts multistep reasoning in small language models (1–8B).

License

Notifications You must be signed in to change notification settings

AlexBraverman/TTASearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Test-Time A* Search (TTA*)

Training-free test-time tree search to improve multistep reasoning in Small Language Models (SLMs)

TTA* casts reasoning as a goal-directed tree search where a single SLM iteratively generates, critiques, and refines candidate solutions. Inspired by A* search, TTA* balances path cost and a model-derived heuristic (self-evaluation) to prioritize expansions — improving correctness on math benchmarks without additional training or external teacher/reward models.


Highlights

  • Training-free: works as an inference-time wrapper — no fine-tuning or extra models required.
  • Robust self-reflection: averages multiple self-evaluations to stabilize noisy SLM critiques.
  • Practical: designed for 1–8B models (LLaMA, Qwen, etc.), enabling deployment on consumer GPUs.

Features

  • Implementation of the TTA* search loop: root generation → critique → self-evaluation → child generation → A* selection.
  • Configurable hyperparameters (temperature, children per expansion, A* weight w, max_iterations).
  • Full list of prompts

Datasets

Files

  • benchmark.py — for running benchmarks on models and datasets.
  • tta.py — implementation of our method.

About

TTA*: Test-Time A* Search — training-free, test-time tree search that boosts multistep reasoning in small language models (1–8B).

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages