Provider data landscape · One cell per state × specialty

The federal provider directory, decomposed

A free, public audit substrate for the CMS National Provider Directory and the REAL Health Providers Act. Each tile is one state × specialty cell. Area scales with the count of active practitioners. Color is the metric in the layer you select — switch layers without losing your place. Click any cell to verify the methodology against primary federal sources. Looking for the state-by-state federally-excluded view? Open the map →

Active practitioners
1.1M
across 548 cells
Current metric
80.6%
national, practitioner-weighted
NDH release
2026-05-08
CMS public use file
Methodology
v0.7.1
seed payload
Layer
Tile by

Each box is one specialty. Cells inside it are states.

Completeness. Share of provider records with every § 6220-required field populated (name, specialty, contact, address, new-patient acceptance, ADA, language, telehealth).

Loading visualization…

How to read this

  • Spatial layout does not change when you flip layers — only color animates. The same cell sits in the same place, so you can learn the geography once and watch each metric move across it.
  • Area = scale. A large California allopathic-physician cell carries more practitioners than the entire Vermont workforce; the treemap encodes that directly.
  • Cells with fewer than 25 practitioners are suppressed to protect against PHI risk on small populations and to keep the visual readable.
  • Color is normalized per layer to a constant diverging scale (red = worse, green = better). Higher completeness, agreement, reachability, integrity, and specialty validity are better; lower median update days are better.

Methodology & data lineage

Each metric is computed by pre-aggregation in BigQuery (analysis/landscape.py) and emitted as a typed JSON file: /api/v1/landscape.json. External consumers, regulators, and researchers can pull the same file as the visualization. Methodology version: 0.7.1-draft-seed · Release: 2026-05-08 · Generated: 2026-06-02T00:00:00+00:00.

Per-dimension methodology references: completeness · cross-source agreement · currency · reachability · integrity.

Seed data notice: the current payload is a deterministic synthetic seed used for UI development. Run python analysis/landscape.py against BigQuery to replace with measured values before relying on cell-level numbers for any external use.