BASHIRI: Learning Failure Oracles from Test Suites

BASHIRI is an approach to infer failure oracles from a small test suite automatically.

Abstract

Program fixes must preserve passing tests while fixing failing ones. Validating these properties requires test oracles that distinguish passing from failing runs.

We introduce BASHIRI, an approach that learns failure oracles from test suites with labeled outcomes using execution features. BASHIRI leverages execution-feature-driven debugging to collect program execution features and trains interpretable models as testing oracles.

Our evaluation shows that BASHIRI predicts test outcomes with 95% accuracy, effectively identifying failing runs.

BASHIRI is available as an open-source tool at https://github.com/smythi93/bashiri

A demonstration video is available at https://youtu.be/D2mJkCtSXtM

Installation

To install BASHIRI and its dependencies, run the following command:

python -m pip install .

Usage

For BASHIRI, you need to instrument your subject.

instrument("middle.py", "tmp.py", "middle.json")

Next, you need some tests to execute and collect their event traces. We provide two collectors, one for unit tests and one for input to the program. However, implementing another collector by inheriting the base class EventCollector and implementing its collect() method is an option. To employ the collector, use it like this:

oracle = Bashiri(
    "middle.py",
    (passing, failing),
    access="tmp.py",
    mapping="middle.json",
    work_dir=".",
)
oracle.learn()

Now, we can leverage the collector, handle, and oracle to identify the result of an unseen execution/test case with high accuracy.

result = oracle.predict(unseen_test)

Mapping

The mapping approach infers connections between the events if different versions of a program, e.g., a faulty and a fixed version. This mapping allows to continuously apply the learned oracles.

patch = PatchTranslator.build_t4p_translator(project)
creator = MappingCreator(mapping_bug)
mapping = creator.create(mapping_fix, patch)
translation_mapping = EventMapping(
    mapping_bug.mapping,
    translation=mapping.get_translation(),
    alternative_mapping=mapping_fix.mapping,
)

Human-In-The-Loop

We have additionally implemented a mechanism to incorporate a human oracle to enrich insufficient test sets in the HumanOracleRefinement class.

seeds = {i: s, for i, s in enumerate(passing + failing)}
refinement = HumanOracleRefinement(oracle, seeds, collector)
refinement.run()

Example

We have provided a short example in the example.ipynb notebook. This notebook demonstrates how to use BASHIRI with a simple example, including instrumentation, test collection, and oracle learning.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
evaluation		evaluation
src/bashiri		src/bashiri
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.ipynb		example.ipynb
middle.json		middle.json
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BASHIRI: Learning Failure Oracles from Test Suites

Abstract

Installation

Usage

Mapping

Human-In-The-Loop

Example

License

About

Uh oh!

Releases

Packages

Languages

License

smythi93/bashiri

Folders and files

Latest commit

History

Repository files navigation

BASHIRI: Learning Failure Oracles from Test Suites

Abstract

Installation

Usage

Mapping

Human-In-The-Loop

Example

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages