The workflow incorporating compare-results and analyze should also be testing coverage (pytest-cov) during the test run, and fail if the coverage percentage drops.
We should also have a summary screen whether it passes or fails, not just on fail, and should incorporate all permutations (fixed discovery warnings (rename current Discovery Warnings to "New Discovery Warnings"), fail to pass, skip to pass, xfail to pass and new tests.