[Question] Clarification on Evaluation Metrics and Their Correspondence to Paper Results

![Image](https://github.com/user-attachments/assets/51cb36cd-a186-4a55-be28-32c64482812e)

Hi, authors and community members,

I recently tried running the official evaluation script provided in this project. The command I used is:

```bash
python train_net.py --eval_only --resume --num-gpus $n --config-file configs/semantic_sam_only_sa-1b_swinL.yaml COCO.TEST.BATCH_SIZE_TOTAL=$n MODEL.WEIGHTS=/path/to/weights

```

The evaluation completed successfully, and I obtained metrics like the following (please see the attached screenshot as well):

```
'noc@0.5': 8.81
'noc@0.8': 13.51
'noc@0.9': 16.86
'miou@iter1': 0.5508
....
```

However, as I am relatively new to the segmentation field, I am not fully sure how these metrics correspond to the results or tables reported in the paper.

Could anyone kindly help me with:

- A brief explanation of what these metrics mean in practice

- How they align with the reported numbers in the paper (e.g., specific tables or benchmarks)

- Whether there is an official recommended evaluation setup to exactly reproduce the reported results

Any suggestions or clarifications would be greatly appreciated. Thanks a lot in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Clarification on Evaluation Metrics and Their Correspondence to Paper Results #119

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Question] Clarification on Evaluation Metrics and Their Correspondence to Paper Results #119

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions