Confidence and evidence¶

Venturi attaches an honest measure of certainty to every attribution and shows the evidence behind it. This is what makes the numbers safe to act on: you can tell at a glance whether a chargeback line is directly resolved, strongly inferred, or genuinely uncertain — and decide accordingly.

This page explains the confidence score, the output states, the evidence card, and how Venturi handles the honest unknown.

Operational confidence: `coper`¶

Every attributed edge carries one customer-facing confidence number, coper (operational confidence), on a scale from 0 to 1.

coper is capped at 0.95

coper is capped at 0.95 — a deliberate, conservative policy ceiling. Venturi never reports an attribution as fully certain, even when its internal evidence is a direct, exact match. This is a transparency decision, not a statement about model error: it keeps every score auditable and prevents overconfident chargebacks. There is one customer-facing confidence object, and 0.95 is its ceiling.

You will see coper on dashboards, in exports, and in API responses, both as a number and as a band.

The chargeback floor: 0.80¶

For an attribution to be chargeback-eligible — billed back to a team or used as the base for savings-share billing — it must clear a single, canonical confidence floor:

Chargeback floor: coper ≥ 0.80

An attribution is eligible for chargeback only when its operational confidence is 0.80 or higher. This is the same floor used everywhere Venturi makes a financial commitment, so what you can charge back and what you can bill on are always governed by one consistent rule.

Attributions below 0.80 remain fully visible and useful for analysis and investigation — they are simply not promoted into a chargeback line until they clear the floor. Allocation-derived and network-inferred splits are capped below 0.80 by design, so shared-cost estimates are never silently charged back as if they were direct ownership.

Confidence bands¶

To make the score easy to act on, coper is summarized into bands:

Band	Range	What it means
Chargeback-ready	`coper ≥ 0.80`	Confident enough to bill back. Cleared the canonical floor.
Provisional	`0.50 ≤ coper < 0.80`	A usable estimate — good for analysis, not yet for chargeback.
Estimated	`coper < 0.50`	A weak signal. Surface it, investigate it, do not bill on it.

Fractionally allocated and network-derived edges can never reach Chargeback-ready; their highest attainable band is Provisional, because they represent shared or indirect cost rather than direct ownership.

Output states¶

Confidence answers how sure; the output state answers how the answer was reached. Venturi emits exactly six output states per resolved edge, and they never blur together:

Output state	Meaning	Comes from
`deterministically_resolved`	Resolved from facts with certainty.	Stage A — exact keys and rules.
`strongly_inferred`	Inferred with high confidence by the model.	Stage B — RAIL.
`bounded`	Uncertain enough that Venturi reports a constrained range, not a single owner.	Stage B.
`ambiguous`	Multiple candidates remain plausible and no single answer is justified.	Stage A conflict or Stage B.
`unknown`	The evidence does not support an attribution at all.	Any stage.
`not_identifiable`	The signal cannot be tied to a known node in your environment.	Any stage.

These states let attribution degrade gracefully — from precise, to provisional, to a transparent unknown — without ever collapsing distinct kinds of truth into one falsely precise number. The last two states are how Venturi tells you the truth instead of guessing.

Evidence cards¶

Every customer-visible result carries the metadata you need to interpret and trust it — an evidence card. Wherever you can see an attribution, you can open its card and see exactly why Venturi reached that conclusion:

Field	What it tells you
Stage origin	Which stage produced the result: `stage_a`, `stage_b`, or `stage_c`.
Output state	One of the six states above.
Confidence	The `coper` score and its band.
Evidence basis	The signals that supported the attribution — e.g. direct account binding, HRIS membership, deployment metadata, telemetry correlation.
Model version	When inference was involved, the trained model version that produced it, linked to its tracked record.
Result source	Whether the answer was produced by a trained model or by a heuristic baseline — carried as a distinct, queryable marker so a fallback result is never mistaken for a model-backed one.
Degradation state	Whether the result was produced normally or by a fail-open fallback.
Demotion reason	If a previously chargeback-ready edge has been capped below the floor, the reason it was demoted — calibration drift or model age.
Freshness	The timestamp the underlying data was current as of.

Because every result carries its basis, a disputed chargeback is never a black box. You can trace any number back to the evidence — including conflicts, where Venturi preserves all the disagreeing sources rather than hiding them.

The honest unknown¶

The most important thing the engine does is refuse to guess when the evidence does not justify an answer.

Unknown is a first-class answer

When attribution genuinely cannot be resolved, Venturi marks the result unknown or not_identifiable and surfaces it explicitly. Unattributed and shared spend is visible and exportable — never coalesced into an aggregate, never silently assigned to the nearest team.

This is why the dashboard shows you an unattributed-spend figure rather than hiding it. When you investigate one of these, the evidence card tells you why it is unknown — stale org data, a shared API key, conflicting CI/CD metadata, weak telemetry, or an unmapped identity — so you know whether the fix is a data-quality issue, a coverage gap, or a genuine limit of what is knowable.

Confidence you can trust over time¶

A confidence score is only worth as much as it stays true. An edge that was chargeback-ready last quarter is not automatically chargeback-ready today: the model behind it can drift, and the data behind it can age. Venturi watches for both and moves an edge out of chargeback-grade the moment it can no longer stand behind the number — it never silently keeps billing on decayed precision.

A continuous calibration and freshness check¶

For every serving model, Venturi continuously measures whether its confidence still matches reality — comparing the scores it emits against accumulating ground-truth outcomes — and watches how long that model has been in service. When either signal says the edge is no longer trustworthy at chargeback grade, the edge is demoted:

What Venturi detects	What happens to the edge
Calibration has drifted — the model's confidence no longer tracks real outcomes over a sustained window.	The edge is capped to at most Provisional (`coper` held below the 0.80 floor) until the model requalifies.
The model has aged past its serving limit.	The edge is capped below the floor and flagged for refresh, so a stale model stops feeding chargeback lines.

A stale edge cannot stay chargeback-ready

These checks only ever demote confidence — they never raise it. Restoring an edge to Chargeback-ready always requires a fresh, human-approved model, never an automatic promotion. When an edge is demoted, its evidence card records why — calibration drift or model age — so a previously chargeback-grade line that drops is explained, not mysterious.

The practical effect: you are never billed on false precision. If the engine cannot still prove an attribution is accurate at chargeback grade, that attribution leaves the chargeback base on its own — visibly, with a reason you can read — rather than quietly continuing to bill at a confidence it no longer earns.

Knowing a model stood behind the result¶

On the gateway hot path, Venturi is built to fail open: if the trained model cannot answer in time, a fast heuristic baseline answers instead so your requests are never blocked. That is the right behavior for availability — but a fallback answer must never be mistaken for a model-backed one, even when the heuristic is confident.

Every result says whether a model produced it

Each result carries an explicit, queryable result source marker — trained model versus heuristic baseline — alongside a degraded flag, and this marker travels with the result all the way to the API, unchanged even when the heuristic's coper is 0.80 or higher. A confident fallback is still labeled a fallback.

This is the signal Venturi itself uses to keep degraded results honest: fallback attributions are excluded from the savings counterfactual and from adoption cohort rollups, so a number you are billed on is never built on a result the model did not actually produce. You can filter on the same marker — in the dashboard, in exports, and in API responses — whenever you need to confirm that a given chargeback or savings figure rests on trained-model attribution.

How to act on confidence¶

You see…	Do this
`deterministically_resolved`, Chargeback-ready	Bill it back with confidence.
`strongly_inferred`, Chargeback-ready	Bill it back; review the evidence card if disputed.
`bounded` or Provisional	Use for analysis and budgeting; investigate before charging back.
Allocated split	Treat as a fair fractional estimate of shared cost, not direct ownership.
A demoted edge (Chargeback-ready → Provisional)	Read the demotion reason on the evidence card — calibration drift or model age — and expect it to requalify once the model is refreshed; do not bill on it while demoted.
Result source = heuristic baseline / degraded	Treat as a fallback estimate; exclude it from chargeback and savings figures until the same edge is re-observed under trained-model attribution.
`unknown` / `not_identifiable`	Investigate the cause shown on the evidence card; improve the underlying data source.

Where to go next¶

The HRE pipeline — the three stages that produce these scores and states.
The attribution graph — the edges each result attaches to.
Energy and carbon — how Venturi applies the same honest-unknown discipline to energy estimates.