How a claim enters the Atlas — the sourcing rules, data tiers, freshness policy, and what Reviewed on a page actually means. Methodology is the trust signal, not star counts.
Every factual claim in the Atlas must satisfy four rules before it is published. No exceptions.
data.json field "source".src comments in the generated HTML.Not all data refreshes the same way. The Atlas uses three tiers, visible on every instrument card and the data API page.
| Tier | How it works | Instruments | Cadence |
|---|---|---|---|
| Tier 1 — Live | A refresh.sh script pulls data from a public API or dataset at 06:00 UTC daily. No credentials. The generated timestamp in data.json is the actual pull time. |
EC2 Observatory, Azure VM Atlas, OCI Compute Observatory, OVH Instance Catalogue | Daily |
| Tier 2 — Curated | Hand-maintained data that is re-serialized to data.json by refresh.sh, but the underlying data only changes when a human edits the source HTML and re-stamps the verified date. The script proves the JSON faithfully mirrors the HTML; a CI region-drift guard pins the set to a vendor-verified reference. |
Region Map, GCP Compute Index | On change (human-triggered) |
| Tier 3 — Dated snapshot | Fully hand-curated reference content. Each page carries a Verified · YYYY-MM-DD stamp. CI freshness guards alert when a snapshot is older than its threshold (45 days for the fast-moving Generative AI Atlas; 180 days for the compliance and cross-cloud matrices; 365 days for Decisions). | All APIM instruments, Kubernetes, IAM, Networking, Compliance, Confidential Computing, Observability, Sovereignty, AI Atlas, Egress, Equivalent-SKU, Decisions | On re-verification |
The hero "Updated daily · 06:00 UTC" refers specifically to the Tier 1 instruments. The manifesto and colophon on the homepage state this correctly. Do not generalise it to the rest of the Atlas.
On a Tier 3 instrument, the Verified · YYYY-MM-DD stamp is an assertion that, on that date, a maintainer:
◐ (partial), or annotated with a caveat note.src field (visible in the detail drawer). A verified page has no unsourced claim. If a source cannot be found for a claim, the claim is removed or downgraded to ◐ with a note.The following checks run on every push and fail loudly if data drifts. They are the mechanical layer of the sourcing commitment.
check_region_drift.py pins the Region Map's provider sets to a dated, vendor-verified region-reference.json. Any region added or removed without updating the reference fails CI.verify-data.yml re-extracts data.json from the curated index.html for Tier 2 instruments and diffs it against the committed file. Proves the JSON faithfully mirrors the HTML source.check_landing_stats.py derives instrument counts from data.json and fails if a homepage chip shows a stale number. sync_landing_stats.py self-heals the chips after each daily refresh.check_instrument_count.py counts the live instrument cards in the HTML and fails if any static count literal (hero stat, section header, filter chip) diverges. Prevents the "21 vs 22 instruments" self-report drift.check_og_card.py fails if the social share card's instrument count diverges from the live cards. build_og.py self-heals it.verify-freshness.yml fails when a Tier 3 instrument's Verified stamp is older than its threshold, forcing a re-verification pass. Thresholds: AI Atlas 45 days; most matrices 180 days; Decisions 365 days; European Sovereignty 183 days.The Atlas deliberately uses three levels in its comparison matrices:
Per-region family membership in the EC2, Azure and OCI compute instruments is sourced from public datasets and is best-effort, not independently credential-verified. The exact wording is in docs/data-policy.md §3. GCP compute has no credential-free price source; the instrument carries no pricing column rather than invent one.
One engineer, working in public. The Atlas is open-source on GitHub. All data sources are public, credential-free, and documented. Corrections are welcome — the fastest path is an issue with the contradicting upstream source.
There is no company, no investor, no ad network, no vendor relationship. The Atlas makes no money from the data it shows. The funding model is reader-supported donations; the independence model is "the methodology is public and auditable".