TTR: Test-Time Reasoning for Robust CLIP Zero-Shot Classification

Welcome! This is the official implementation of our ICCV 2025 paper, "Enhancing Robustness in CLIP-Based Zero-Shot Classification via Test-Time Reasoning (TTR)."

TL;DR: We observe that CLIP makes accurate predictions in settings free of spurious features; CLIP model learned semantically meaningful representations. Motivated by those, we propose Test-Time Reasoning (TTR)—a simple yet effective method that improves robustness by identifying and removing irrelevant (spurious) features at inference time. TTR requires no additional training or spurious feature annotations and significantly enhances model robustness in zero-shot classification.

Installation

Requirements

Python 3.12
PyTorch (installation guide)
OpenCLIP

Steps to Install

Clone the repository and install the required dependencies:

conda env create -f environment.yaml --name newenv
conda activate newenv

Reproducing Experiment

We provide Jupyter notebooks to ensure transparency and easy reproducibility. Each notebook directly corresponds to the specific experimental results presented in our paper.

Variant of waterbirds dataset result

To reproduce the main results from Table 1, run the notebooks:

OV.ipynb
Core blk.ipynb
Core Wht.ipynb

OV.ipynb – Baseline evaluation on the Original Variant dataset.
Core blk.ipynb – Evaluation with a uniformly black background.
Core Wht.ipynb – Evaluation with a uniformly white background.

Semantic identification recipe.

To reproduce the results from Table 2, run the following notebooks (each corresponding to a method presented in the paper):

cd Ablation Introduction
GDINO.ipynb
SAM+GDINO.ipynb
PCA+K-means.ipynb

Main results

To reproduce the results from Table 3, run the following notebooks. Each notebook can also display the reasoning results.

Waterbirds.ipynb
CelebA.ipynb
Metashifts.ipynb
Urbancars.ipynb

Attention Map Visualization Results

To visualize the attention maps, run the following notebook:

Attention-TTR.ipynb

This code is adapted from Transformer-MM-Explainability.

If you find this work interesting, please consider citing our paper.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Ablation Introduction		Ablation Introduction
Figure		Figure
Section 3		Section 3
Attention-TTR.ipynb		Attention-TTR.ipynb
CelebA.ipynb		CelebA.ipynb
ISIC.ipynb		ISIC.ipynb
Metashifts.ipynb		Metashifts.ipynb
README.md		README.md
Urbancars_final.ipynb		Urbancars_final.ipynb
Waterbirds.ipynb		Waterbirds.ipynb
environment.yaml		environment.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TTR: Test-Time Reasoning for Robust CLIP Zero-Shot Classification

Installation

Requirements

Steps to Install

Reproducing Experiment

Variant of waterbirds dataset result

Semantic identification recipe.

Main results

Attention Map Visualization Results

About

Uh oh!

Releases

Packages

Languages

lu876/TTR

Folders and files

Latest commit

History

Repository files navigation

TTR: Test-Time Reasoning for Robust CLIP Zero-Shot Classification

Installation

Requirements

Steps to Install

Reproducing Experiment

Variant of waterbirds dataset result

Semantic identification recipe.

Main results

Attention Map Visualization Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages