Test Data Selection by Failure Coverage (FormaliSE 2026 - Research Track)

Who

Amani Ayad, Ali Mili

Track

FormaliSE 2026 Research Track

Time Zone

The program is currently displayed in (GMT-03:00) Brasilia, Distrito Federal, Brazil.

Use conference time zone: (GMT-03:00) Brasilia, Distrito Federal, BrazilSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 13 Apr 2026 12:15 - 12:30 at Oceania VIII - Session 4: Automated Reasoning, and Program Analysis

Abstract

Statement coverage is a perfect measure of test suite effectiveness, but only if the purpose of a test suite is to cover statements. Likewise, branch coverage, condition coverage, line coverage, mutation coverage, etc. are all excellent measures of test suite effectiveness, if the purpose of a test suite is, respectively, to cover branches, cover conditions, cover lines, or kill mutants. But if we posit, as we do in this paper, that the purpose of a test suite is to expose the failures of an incorrect program (or, equivalently, to give us confidence in the correctness of a correct program), then we ought to equate the effectiveness of a test suite with its ability to expose failures. In this paper we consider a failure based definition of test suite effectiveness, which we call failure coverage, and we analyze its relationship to traditional criteria for test suite adequacy; unlike all existing measures of test suite effectiveness, failure coverage is not a number but an element of a partially ordered set, which is fitting, given that the relation of being a more effective test suite is itself a partial ordering. Not surprisingly, we find that traditional coverage metrics bear little statistical correlation to failure coverage; but we also find that it is possible to combine them to gain a better, albeit still insufficient, approximation of failure coverage. We sketch a research agenda that aims to estimate failure coverage. To the extent that it is adopted and gains acceptance, the measure of failure coverage can be used to generate novel/ original test adequacy criteria.

Amani Ayad

Mount Saint Vincent University

Ali Mili

NJIT

United States