A Python toolkit for physical reasoning in LLMs and VLMs. This toolkit streamlines access to various physical reasoning datasets/benchmarks and provides a unified interface to different LLM/VLM providers, enabling researchers and developers to analyze, evaluate, and build applications in the physics reasoning domain.
-
Updated
Feb 14, 2026 - Python