Public test categories and what they prove.
| Category | Test pattern | Buyer evidence |
|---|---|---|
| Scale | 50,000 SKU synthetic benchmark | Runtime, grouping stability, report generation |
| False positives | Dimension and material trap catalogs | Unsafe look-alike matches suppressed |
| Recall | Real-world abbreviation chaos catalogs | SAP-style shorthand and verbose descriptions matched |
| Governance | Tier 1 / Tier 2 / Tier 3 outputs | Human review sequence preserved |
| Data handling | Source purge after report generation | No source catalog retention |