Generate repro test errors when model does not run successfully by jo-basevi · Pull Request #173 · ACCESS-NRI/model-config-tests

jo-basevi · 2025-07-29T02:25:32Z

This PR:

Adds more stdout/error logs to exception messages so they will be displayed as part of annotations on the PR workflow logs (to make it easier to find out what went wrong)
Moves checking whether the experiments ran without errors as part of setup in a fixture rather than within the test. This means any exceptions will be reported as a test setup error rather than a test failure. This will hopefully make it more clear when test failures are repro failures rather than the model did not run.

Closes #170

codecov · 2025-07-29T02:27:06Z

Codecov Report

❌ Patch coverage is 84.21053% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.69%. Comparing base (b881d91) to head (9432980).
⚠️ Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
...fig_tests/config_tests/test_bit_reproducibility.py	78.94%	4 Missing ⚠️
src/model_config_tests/exp_test_helper.py	89.47%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main     #173       +/-   ##
===========================================
+ Coverage   68.39%   85.69%   +17.29%     
===========================================
  Files          14       22        +8     
  Lines         829     1244      +415     
===========================================
+ Hits          567     1066      +499     
+ Misses        262      178       -84

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

… to exceptions Add stdout and stderr to model run exceptions so it is displayed in the test output on Github

…ixture This means any errors from non-zero exit status from model runs are reported as test setup errors vs reproducibility test failures

jo-basevi · 2025-12-02T23:53:46Z

Rebased changes onto main and resolved conflicts.

See https://pytest-cov.readthedocs.io/en/latest/subprocess-support.html

jo-basevi · 2025-12-03T03:41:15Z

The latest version of pytest-cov dropped support for subprocess measurement by default (see https://pytest-cov.readthedocs.io/en/latest/subprocess-support.html). As a number of tests for tests currently use subprocess to invoke the tests, this python code coverage wasn't being measured. Added measuring code coverage in subprocess call back in 9432980

CodeGat

We ran through this on a call, looks good!

jo-basevi mentioned this pull request Jul 30, 2025

Clean up repro CI test outputs #176

Merged

jo-basevi requested a review from CodeGat August 11, 2025 00:20

jo-basevi added 4 commits December 3, 2025 10:51

Raise RuntimeErrors for failed model runs, and add more detailed logs…

8c7d706

… to exceptions Add stdout and stderr to model run exceptions so it is displayed in the test output on Github

Repro tests: Move checking experiment runs to requested_experiments f…

02dc135

…ixture This means any errors from non-zero exit status from model runs are reported as test setup errors vs reproducibility test failures

Add a test to check errors are caught and re-raised in Experiments class

be5b708

Add a test_test_repro_determinism test

5a53d1a

jo-basevi force-pushed the 170-raise-error-for-model-did-not-run branch from 5fed4da to 5a53d1a Compare December 2, 2025 23:52

Add coverage patch subprocess

9432980

See https://pytest-cov.readthedocs.io/en/latest/subprocess-support.html

CodeGat approved these changes Dec 5, 2025

View reviewed changes

jo-basevi merged commit cb6a51e into main Dec 11, 2025
8 checks passed

jo-basevi mentioned this pull request Dec 11, 2025

Usability improvement for test repro messages #191

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate repro test errors when model does not run successfully#173

Generate repro test errors when model does not run successfully#173
jo-basevi merged 5 commits intomainfrom
170-raise-error-for-model-did-not-run

jo-basevi commented Jul 29, 2025

Uh oh!

codecov bot commented Jul 29, 2025 •

edited

Loading

Uh oh!

jo-basevi commented Dec 2, 2025

Uh oh!

jo-basevi commented Dec 3, 2025

Uh oh!

CodeGat left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jo-basevi commented Jul 29, 2025

Uh oh!

codecov bot commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jo-basevi commented Dec 2, 2025

Uh oh!

jo-basevi commented Dec 3, 2025

Uh oh!

CodeGat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Jul 29, 2025 •

edited

Loading