Add synthetic convergence benchmark test by Borda · Pull Request #638 · roboflow/rf-detr

Borda · 2026-02-04T12:15:40Z

This pull request adds a new benchmark test to ensure that training on a synthetic dataset improves model performance, specifically mean average precision (mAP@50) and validation loss. The test generates a synthetic COCO-style dataset, evaluates the model before and after training, and asserts that training leads to measurable improvements.

New synthetic benchmark test:

Added a slow-running test test_synthetic_training_improves_map50 to tests/benchmarks/test_synthetic_convergence.py that:
- Generates a synthetic COCO-format dataset with basic shapes.
- Evaluates an untrained RFDETRNano model's baseline mAP@50 and validation loss.
- Trains the model for 2 epochs on the synthetic data.
- Evaluates the trained model and asserts that mAP@50 increases and validation loss decreases.
- Saves diagnostic results to synthetic_benchmark.json.

This commit introduces a new test case to benchmark synthetic dataset training convergence, verifying improvements in mAP@50 and validation loss after training.

codecov · 2026-02-04T12:19:03Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 54%. Comparing base (2b3f550) to head (a81ea42).

Additional details and impacted files

@@           Coverage Diff            @@
##           develop   #638     +/-   ##
========================================
+ Coverage       25%    54%    +29%     
========================================
  Files           47     47             
  Lines         6342   6342             
========================================
+ Hits          1577   3400   +1823     
+ Misses        4765   2942   -1823

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

- Use `tqdm.auto` for better progress bar compatibility. - Extract synthetic dataset generation logic into a reusable pytest fixture. - Simplify benchmark test by integrating the fixture.

…t configuration

…rgence test - Introduce `synthetic_color_dataset_dir` for color-based dataset generation. - Rename `synthetic_dataset_dir` to `synthetic_shape_dataset_dir` for clarity. - Refine synthetic convergence test with stricter assertions on mAP@50 and validation loss.

…vergence test configuration - Delete `synthetic_color_dataset_dir` fixture as it is no longer used. - Update model initialization to disable pretrain weights. - Increase training epochs in synthetic convergence test. - Relax mAP@50 assertion threshold and refine validation loss message.

…eneration - Enable control over the minimum and maximum number of objects per image. - Update `generate_synthetic_sample` calls and docstrings accordingly.

… loss tracking - Add GPU support fallback for test execution. - Include train dataset evaluation and diagnostics for better loss comparison. - Refine loss and mAP@50 assertions for improved test accuracy.

- Update assertion messages to include variable names for better readability. - Adjust messages to improve debugging context in test failures.

Add synthetic convergence benchmark test

a81ea42

This commit introduces a new test case to benchmark synthetic dataset training convergence, verifying improvements in mAP@50 and validation loss after training.

Borda added 7 commits February 5, 2026 00:07

Refactor synthetic dataset handling in tests

9a44c29

- Use `tqdm.auto` for better progress bar compatibility. - Extract synthetic dataset generation logic into a reusable pytest fixture. - Simplify benchmark test by integrating the fixture.

Refactor checkpoint handling and streamline synthetic convergence tes…

00bdc00

…t configuration

Add min_objects and max_objects parameters to synthetic dataset g…

01c96cc

…eneration - Enable control over the minimum and maximum number of objects per image. - Update `generate_synthetic_sample` calls and docstrings accordingly.

Update synthetic convergence test to use GPU if available and enhance…

deb9efb

… loss tracking - Add GPU support fallback for test execution. - Include train dataset evaluation and diagnostics for better loss comparison. - Refine loss and mAP@50 assertions for improved test accuracy.

Refine synthetic convergence test assertions for clarity and debugging

804044c

- Update assertion messages to include variable names for better readability. - Adjust messages to improve debugging context in test failures.

Borda force-pushed the ci/benchmark branch from 6b27ca1 to 804044c Compare February 5, 2026 03:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add synthetic convergence benchmark test#638

Add synthetic convergence benchmark test#638
Borda wants to merge 8 commits intodevelopfrom
ci/benchmark

Borda commented Feb 4, 2026

Uh oh!

codecov bot commented Feb 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Borda commented Feb 4, 2026

Uh oh!

codecov bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov bot commented Feb 4, 2026 •

edited

Loading