Currently, the tests are done on a (very small) collection of ZX-diagrams drawn from arxiv:2012.13966. This approach needs to be replaced by automated testing on a representative collection of ZX-diagrams.
Might be useful to complete #1 to make this simpler.