Data quality · Diagnostics

Coverage & QA — xwalk, orphans, price coverage

Pipeline-trust diagnostics for the public-data layer. Before we ship the opportunity set externally, these are the numbers that tell us how much to trust the plant universe, how cleanly we match ONS events to SIGA assets, and where price gaps distort R$ estimates. Anything tracked here is a leading indicator of external credibility — a single large-MW orphan in the wrong place embarrasses us on the first call.

Xwalk rows
Event rows scanned
SIGA-op orphans ( MW) · EOL / UFV
Mean price coverage
Built

Headline QA

Xwalk — exact CEG match
of plants · gold-tier join
Event orphan rate
of event rows without tartana_asset_id
Price coverage — mean
min month: · CMO ½-hours matched
SIGA-op orphans (≥3 MW)
MW · EOL ( MW) · UFV ( MW)

Xwalk tier distribution

ONS constrained-off events join the SIGA asset master through a plant xwalk. Tiers from best to worst: exact ceg match, canonical-name match, fuzzy, manual override. Everything outside the top tier is a candidate for manual review.

Plants by xwalk tier

Tier breakdown

Price coverage by month

Fraction of curtailed half-hours with a CMO price attached. Dips indicate bronze CMO gaps — typically month-boundary upload lag at ONS. A dashed line at 95% marks our operational bar.

SIGA-operating orphans — ≥ 3 MW EOL, no event match

Wind plants SIGA lists as operating and utility-scale but which never appear in the constrained-off feed, matched or otherwise. These are either (a) xwalk misses we should chase, (b) plants that genuinely never curtailed in the window, or (c) very recent commissioning dates. Review by plant size — large-MW orphans in curtailed submercados are the highest-value thing to investigate.

Operational priority. An orphaned CEG with high nameplate in a curtailment-heavy submercado (NE wind, for instance) is the single highest-leverage investigation in QA. If the plant is live and being constrained-off but we're not joining its events, we're leaving backlog on the table. Review the top rows manually and seed the xwalk_manual override CSV.
What this page does NOT cover. It validates the ONS→SIGA spine and the CMO price spine. It does not validate legal eligibility (per-event compensable flags are private) or holdco consolidation (curated dim_grupo_economico is a separate map). Both are expected — the point here is to know the public-data floor is clean.