Dataset Quality Management Toolkit
A COCO-format dataset QA toolkit for validating annotations, analyzing dataset distribution, and prioritizing suspicious samples for human review.
- COCO annotation validation
- bbox, category, and reference checks
- class and object-size distribution report
- review queue design
- issue taxonomy for labeling errors
Instead of manually checking random samples, this toolkit helps prioritize high-risk samples for human review.