Topic 02 · AI Tagging

AI Tagging.

The vendor-neutral research index for AI image tagging. Capability matrices across classical computer-vision APIs (Google, AWS, Azure, Clarifai, Imagga, Hive, Cloudinary) and frontier multimodal LLMs (Anthropic Claude, OpenAI GPT-4o, Google Gemini). Same rubric. Same month. Public docs only.

1 report11 statistics4 cited from external research1 report in preparation

Featured report

Report 03 · Live Index · v1.0

The AI Tagging Provider Index, May 2026

10 image-tagging APIs × 6 capability dimensions, scored from public docs. Pricing, free tier, OCR, custom training, multimodal reasoning, SLA. Re-verified monthly. Vendor corrections invited.

n=10 providersPublished 2026-05-27

Statistics — featured (cited from external research)

Statistics — from the AI Tagging Provider Index (capability slices)

In preparation

Report 05 · In preparation

AI Tagging Accuracy Field Study

Precision, recall, and human-agreement rates of LLM-generated creative tags across 10,000+ human-verified assets. Per category, per model. Drops Q3 2026.

Q3 2026

What we're publishing on this topic

AI tagging is the most-claimed feature in 2026 DAM and creative-tooling marketing, and the least-rigorously-benchmarked in the same year's journalism. The Provider Index (Report 03) measures the capability surface of every major image-tagging API — what they can do, where they hide pricing, where the classical-CV vs frontier-LLM split shows up. The Accuracy Field Study (Report 05, in preparation) will measure what tags they actually produce, scored against human-verified ground truth. Both reports re-verify on a published cadence; corrections from providers fold in within 7 days.

If you build on a vision API and want to nominate a provider for the next index revision, or share a behavioural data point we should incorporate, mail the team via the About page.

AI Tagging.

Featured report

The AI Tagging Provider Index, May 2026

Statistics — featured (cited from external research)

Cost spread between cheapest and priciest frontier multimodal LLM

Top MMMU-Pro multimodal score, May 2026 (GPT-5.4 Pro)

Multimodal LLMs lose 17-27 points when text shortcuts are removed

Image recognition API market: $3.12B (2025) to $7.49B (2032), 13.6% CAGR

Statistics — from the AI Tagging Provider Index (capability slices)

Vision APIs that can answer free-form questions about an image

Vision APIs publishing per-unit pricing

Vision APIs with a free tier — no credit card

Vision APIs with documented OCR / text extraction

Vision APIs with custom model training via API

Vision APIs publishing a paid-tier uptime SLA

Average AI-generated tags per creative asset

In preparation

AI Tagging Accuracy Field Study

What we're publishing on this topic