AI audit vs rubric — Alibaba
An independent Workers AI LLM scored Alibaba against the same published rubric. The deterministic rubric result is our canonical score. The LLM's result is shown here as a sanity check — never mixed into the scoring formula.
| Dimension | Rubric | LLM | Δ (LLM − Rubric) |
|---|---|---|---|
| Pricing transparency | 72 | 20 | -52 |
| Business transparency | 85 | 80 | -5 |
| Shipping clarity | 70 | 40 | -30 |
| Public reviews | 74 | 60 | -14 |
| Product range | 95 | 70 | -25 |
| Access & onboarding | 95 | 90 | -5 |
| Support track record | 60 | 60 | 0 |
| Store integrations | 55 | 40 | -15 |
| Overall | 77 | 58 | -19 |
What this means: Moderate disagreement — rubric bands may need tightening. Median per-dimension |Δ| is between 5 and 15.
Median per-dimension |Δ| = 14.5.
Pricing transparency is low because pricing is not visible without signup. Business transparency is high because the company is publicly listed and has audited statements. Shipping clarity is low because origins and delivery windows are not clearly published per region. Review score is moderate because there are 4.0-4.3 stars across 500+ reviews. Product range is moderate because there are 100K+ SKUs. Access is high because there is a free plan with no signup required to browse. Support is moderate because there is mixed feedback. Integration is low because there is only one platform integration.This is the LLM's own explanation, not editorial commentary from SupplierSpy. The LLM result is a sanity check on the rubric — never mixed into the scoring formula.