The control plane for AI compute spend

Put AI compute on a budget.

AI has become a runaway, unpredictable line item. Outlay attributes every dollar of LLM and coding-agent spend to the work you already plan — tickets, epics, roadmap — forecasts cost by scope, and estimates what planned work will cost before you build it. Finally, the fastest-growing line item is as predictable as the work that drives it.

Read-only to start · prompts never leave your environment · no app rewrite · built for security review →
Spend by scope · last 30 daysexample
Epic — Q3 Stability platform$6,430
Epic — Billing v2 growth$4,210
Epic — Auth / SSO platform$2,980
Unmapped reconciled to invoice$1,940
Budget burndown · this quarterover pace
$45kof $50k budget · 71% of period elapsed
On pace to land ~$61k — 22% over budget · flagged with 6 weeks left to act
Budget$50k
Projected at current pace$61k
Maps your trackers to your AI tools JiraLinearGitHub Issues Claude CodeCursorAnthropic API
One control plane, two owners

The question leadership keeps asking — finally answerable.

“What is this body of planned work costing us in AI, and are we on track?” Today's tools meter infrastructure — API keys, models, dashboards. Outlay meters the work.

For engineering leaders

Predictable delivery, by scope.

See exactly what each ticket, epic, and sprint costs in AI — and which work is about to blow its estimate — the way you already plan effort.

  • Cost per ticket, epic, sprint, and engineer, reconciled to the real invoice.
  • Estimate planned work from its requirements + design docs — budget an epic before you build it.
  • Forecast a quarter from its open scope, not from last month's bill.
  • Anomaly flags on the one ticket burning 10× its class — before it's a surprise.
For finance & FinOps

Govern the fastest-growing line item.

Allocate AI spend to teams and cost centers, set budgets by scope, and get alerts before overspend — not a surprise invoice at month-end.

  • Allocation of every dollar to a team, cost center, and roadmap item.
  • Budget vs. actual with pace-based projection to end-of-period.
  • Pace-based guardrails that flag a scope trending over before the invoice lands.
How it works

A closed loop: attribute, forecast, guardrail.

Connect the tracker and AI tools you already use. Read-only to start — no app rewrite, no prompts leaving your environment.

1

Attribute

Every dollar of agent & LLM spend mapped to the ticket, epic, and roadmap it belongs to.

# tag / branch / PR → ticket fix/PROJ-123 → PROJ-123 → Q3-stability
2

Forecast

Predict a quarter's cost from its open scope, the way you estimate engineering effort.

# open roadmap × per-class cost Q3 forecast: $48k ± $9k
3

Guardrail

Pace-based alerts flag a team trending over before it blows through — not after.

# projected, not a hard cap epic Q3-stability on pace $22k · over
Attribution nobody else has

Spend mapped to the work — projected on pace.

Every gateway, observability, and FinOps tool stops at an infrastructure tag. Outlay resolves the git context of each AI call back to the ticket — and budgets it like real work.

Budget burndown · epic Q3 Stabilityover
Budget (this quarter)$5,000
Spent so far · 29% of period$6,430
Projected at current pace$22,000

Illustrative. Flagged OVER early — driven by one outlier ticket at 11× its class median.

By ticket
Every dollar attributed to the work it belongs to — ticket, epic, team, sprint, roadmap — and reconciled to the provider invoice.
On pace
Guardrails project the end-of-period total at the current burn rate, so a lead acts before overspend — not after.
Measured, not promised
Every forecast is back-tested on your own closed tickets (leave-one-out), and we show the error and the sample size — never a vendor benchmark.
Metadata only
Attribution runs on categories, token counts, and ticket IDs. Prompt content, model outputs, and keys never leave your environment.
Plan with numbers, not guesses

Estimate a project's AI cost before you build it.

Roadmap and sprint planning shouldn't guess at compute. Hand Outlay the planned work — epics and tickets with their requirements and design docs — and it prices each one against the cost model it learned from your own delivered work, with a confidence range you can budget against.

Planned Q3 backlog · estimatedexample
SSO — SAML + SCIM requirements + design doc$3,800
Billing v2 migration large · thin scope$9,400
Flaky-test cleanup well-specified$1,200
Total estimate~$14,400
Likely range$9,900 – $19,900

Illustrative. Each item priced from your own per-work-type history; thin-scope items get a wider band, not a false-precise number.

From your own history
Each item is priced against the cost-per-work-type model learned from your delivered tickets — not a generic benchmark.
Requirements + design docs
Feed the scope you already wrote. The more context — requirements, design docs, story points — the tighter the range.
Confidence, not false precision
Every estimate carries a low–high band and a confidence tier; under-specified work is flagged to tighten, never silently guessed.
Budget before you commit
See what an epic or a sprint will cost in AI before it's built — so you can rescope, resource, or defer with eyes open.
The platform

From line item to control plane.

Spend mapped to your roadmap

LLM and coding-agent spend attributed to the ticket, epic, sprint, and team it belongs to — the allocation finance has never had for AI.

Forecast by scope of work

Predict a quarter's AI cost from its open scope, then watch budget-vs-actual burn down in real time.

Pace-based guardrails

Projections, not hard caps. A team trending over is flagged ok → warn → over with time to act, and outlier tickets are caught automatically.

Accuracy you can check

Every forecast is back-tested on your own closed tickets, leave-one-out — we show the measured error and the sample size, by work type. No vendor benchmark to take on faith.

Works with your stack

Jira, Linear, GitHub Issues; Claude Code, Cursor, the API. Reliable attribution even for remote / CI agents via explicit task-tagging.

Privacy by architecture

Attribution runs on metadata — connected with read-only tokens. Prompt content, model outputs, and your API key never leave your environment.

Privacy by architecture

Governance without shipping us your prompts.

Most spend tools see everything you send. Outlay is built so the sensitive data physically can't reach us — purpose-built for teams that can't let prompts leave their environment: healthcare, legal, and financial services.

Your toolstracker + AI usage Read-only syncusage counts + ticket ids Outlayspend mapped to work, forecast, budgets
Connected with read-only tokens — Outlay reads usage metadata (models, token counts) and ticket references, never prompt text or model outputs.
Your API keys and prompts never pass through Outlay — we read your provider's usage data, not your traffic.
Attribution and budgets are computed from categories, dollars, and ticket IDs — not content.
Per-deployment isolation, and you can leave anytime — your data stays yours.
How we compare

A different category, on purpose.

FinOps suites allocate cloud bills. Observability tools trace API calls. Native consoles cap per seat. None attribute AI spend to the work you plan, or budget it by scope — because they live at the infrastructure layer and can't see your roadmap.

  Outlay FinOps suites LLM observability Native consoles
Attributes spend to tickets, epics & roadmap cost-center tags only spans / agent-runs seat / team
Forecasts cost from planned scope ~cloud trend forecasts
Pace-based budget guardrails by scope ~account budgets/alerts ~per-seat caps
Forecast accuracy back-tested on your own work leave-one-out, with sample size
Prompts never leave your environment reads billing, not content traces prompts/outputs

Where they fit: keep a FinOps suite for cloud, and observability for debugging quality. Outlay is the layer they can't be — AI spend attributed to the work you plan, forecast by scope, and held to budget. See the full comparison →

Pricing

Start as a design partner.

The budgeting & governance platform is in early access — we're onboarding a first cohort now, and setting pricing with them. Pilots run free.

Design-partner pilot

A focused engagement: connect one tracker + your AI tools, get your real spend mapped to tickets and a budget forecast in weeks — read-only, no app rewrite.

  • Spend attributed to your tickets, epics & roadmap
  • Budget-vs-actual with pace-based guardrails
  • A real ticket-coverage & forecast read on your data
  • Influence over the roadmap and pricing
Book a pilot →
What a pilot looks likeRead-only, no app rewrite. You see your real spend mapped to work in weeks — not a slide deck.
  • Connect one tracker + your AI tools with read-only tokens
  • Spend mapped to tickets, a forecast, and a measured-accuracy read
  • Prompts & keys never leave your environment

Platform pricing for budgeting & governance is set with design partners — talk to us.

FAQ

Questions, answered.

How does Outlay attribute spend to a ticket?

It resolves the work context of each AI call from the most reliable signal available — an explicit task tag from your agent launcher or CI, the git branch, the PR's closing-issue link, or a commit trailer — and maps it to the ticket, epic, and roadmap in Jira/Linear/GitHub. Where a team already links work to tickets it's automatic; where they don't, a one-line explicit tag makes it reliable — even for remote/CI agents where the branch is detached.

Do you see my prompts or my customers' data?

No. Outlay connects with read-only tokens and reads metadata — task category, token counts, ticket IDs, and your provider's usage data. Prompt text, model outputs, and your API key never reach our servers.

Are the guardrails hard caps that block work?

No — they're pace-based projections, not caps. A scope on track to exceed its budget is flagged warn (and over once it crosses), with the projected end-of-period total at the current burn rate, so a lead acts while there's time. A hard cap that binds mid-task just stops work; we flag the trend and outlier tickets instead.

How accurate is the forecast?

We don't ask you to trust a vendor benchmark. Outlay back-tests its forecast on your own closed tickets, leave-one-out — hide a ticket, predict it from the rest, compare to what it actually cost — and shows the measured error and the sample size, by work type. As more work closes, the number sharpens.

Can it estimate work that hasn't been built yet?

Yes — that's the forward estimator. Hand it a planned backlog (epics/tickets with their requirements, design docs, and story points) and it prices each item against the cost-per-work-type model learned from your own delivered work, returning a per-item and total estimate with a low–high confidence band. Under-specified items are flagged to tighten rather than guessed, so you can budget an epic or a sprint before committing to it.

What does it work with?

Trackers: Jira, Linear, GitHub Issues. AI tools: Claude Code, Cursor, and the Anthropic API (per-call usage + the org admin API for reconciliation). Read-only to start — no app rewrite.

How does pricing work?

The budgeting & governance platform is in early access; pilots run free and pricing is set with our first design partners. Talk to us to join.

Put your AI spend back on a budget.

See what your AI compute costs per ticket, epic, and team — and where it's about to go over. We're onboarding a first cohort of design partners now.

Read-only to start · prompts never leave your environment · no app rewrite