Verification-Aware Convolution Neural Networks for Speech Recognition: A case study (FormaliSE 2026 - Research Track)

Who

Syed Ali Asadullah Bukhari, Rosemary Monahan, Barak A. Pearlmutter

Track

FormaliSE 2026 Research Track

Time Zone

The program is currently displayed in (GMT-03:00) Brasilia, Distrito Federal, Brazil.

Use conference time zone: (GMT-03:00) Brasilia, Distrito Federal, BrazilSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 12 Apr 2026 12:00 - 12:15 at Oceania VIII - Session 1: AI and Safety

Abstract

While most neural network verification tools are evaluated on image-based benchmarks, verifying models trained on preprocessed data that bears a complex and nonlinear relationship to the input poses additional challenges. We take the example of speech recognition, where spectrogram-based preprocessing obscures the correspondence between waveform perturbations and model inputs. We consider two convolutional neural networks for spoken digit classification, one operating directly on raw waveform inputs and the other on Mel spectrogram representations, each trained under Projected Gradient Descent (PGD) adversarial settings for varying values of perturbation $\epsilon$. The models are evaluated for robustness using state-of-the-art neural network verifiers, $\alpha,\beta$-CROWN, Marabou, and NeuralSAT. Results show that $\alpha,\beta$-CROWN outperforms other verifiers; however, verification success declines significantly as $\epsilon$ increases, with approximately 94% of samples verified at $\epsilon = 0$ and only 21% at $\epsilon = 0.02$. Our case study highlights several practical challenges in neural network verification for the selected domain. By documenting these challenges, this work guides the development of verification-friendly neural networks and preprocessing design for audio and other domains that involve complex data preprocessing.

Syed Ali Asadullah Bukhari

Maynooth University

Ireland

Rosemary Monahan

National University of Ireland