AI audit vs rubric — Shopify Collective

An independent Workers AI LLM scored Shopify Collective against the same published rubric. The deterministic rubric result is our canonical score. The LLM's result is shown here as a sanity check — never mixed into the scoring formula.

Dimension Rubric LLM Δ (LLM − Rubric)
Pricing transparency 84 20 -64
Business transparency 85 65 -20
Shipping clarity 80 40 -40
Public reviews 85 0 -85
Product range 75 70 -5
Access & onboarding 95 90 -5
Support track record 80 0 -80
Store integrations 55 0 -55
Overall 82 43 -39
What this means: Large disagreement — investigate. The LLM read the published signals very differently from the deterministic rules. Median per-dimension |Δ| = 47.5.
Pricing transparency is low because Shopify Collective doesn't publish source prices without signup. Business transparency is mid-range due to the public parent company and audited statements. Shipping clarity is low because delivery windows per region are not clearly published. Review score is 0 because no reviews are available. Product range is mid-range due to 100K+ SKUs. Access is high because a free plan is available without signup. Support is 0 because no support feedback is available. Integration is 0 because only one platform is natively integrated.
This is the LLM's own explanation, not editorial commentary from SupplierSpy. The LLM result is a sanity check on the rubric — never mixed into the scoring formula.