
Product
AI Billing is (mostly) token plumbing
Raffi Sarkissian • 5 min read
Jun 23
/5 min read
The subscription billing management market is projected to reach $17.95 billion by 2030, growing at a compound annual rate of 16.9% from 2025 to 2030.[1] The companies that get there first will not be the ones with the most features. They will be the ones that got billing right before it became a bottleneck.
This guide covers the full subscription management lifecycle for B2B SaaS: what makes it hard, what good looks like, and where most companies lose revenue without noticing.
Subscription management is the full lifecycle: onboarding, plan changes, renewals, cancellations, and everything in between. For B2B SaaS, that lifecycle is messier than it looks.
Enterprise clients negotiate custom contracts. Pricing changes mid-cycle. Some customers want seats, others want usage, others want both. Your billing system needs to handle all of it, accurately, at scale, without requiring an engineer every time a customer wants to upgrade.
The operational complexity compounds fast. Proration for mid-cycle changes. Multi-entity invoicing across currencies. Tax compliance across jurisdictions. Revenue recognition under ASC 606. Most companies underestimate this until they're already in it.
A robust subscription management system automates these workflows. The goal isn't just fewer manual errors, it's a single source of truth that finance, sales, and customer success can all trust.
Key Benefits:
Most billing problems are lifecycle problems. They happen at specific moments: when a customer signs, when they change plans, when a payment fails, when they're up for renewal. Here's what each stage requires.
The first billing problem is pricing flexibility. B2B customers don't fit standard tiers. They negotiate discounts, custom commitments, usage thresholds, add-ons. A platform that can't represent your actual deal structure forces workarounds. Manual invoices, offline credits, spreadsheet overrides, that create debt you'll pay later.
Three pricing patterns come up constantly in B2B SaaS: pure subscription (seat- or feature-based, fixed billing cycles), usage-based (customers pay for what they consume: API calls, tokens, compute hours), and hybrid (a subscription floor with usage overages on top). AI companies almost always need the third.
Billing is where most of the operational complexity lives. Proration when plans change mid-cycle. Consolidated invoices for multi-product companies. Credit notes when something was overbilled. Dunning when a payment fails.
Real-time metering matters here. Lago ingests up to 1,000,000 billing events per second, relevant for AI companies billing on tokens or GPUs, where usage spikes fast and late billing means lost revenue.
Example:
An AI SaaS provider bills customers a monthly subscription for platform access, plus a variable fee based on the number of tokens processed. Lago's event-driven architecture ingests token usage instantly, applies overage rules, and generates invoices without delay.
Most involuntary churn is preventable. A payment fails. Nobody retries it. The customer churns without wanting to. Dunning, the sequence of retries, reminders, and escalations after a failed payment, is one of the highest-ROI things you can automate.
Beyond failed payments, B2B renewal dynamics are different from B2C. Enterprise clients need approval from multiple stakeholders before renewing. That means sending renewal reminders earlier, not on the same schedule as a consumer subscription. Account managers need visibility into upcoming renewals before they become last-minute scrambles.
The analytics that support this, MRR, churn rate, expansion revenue, net revenue retention, only work if your billing data is clean and centralized. Fragmented data across billing, CRM, and finance is where renewal visibility breaks down.
The best expansion motion doesn't require a sales call. Customers that can see their own usage, understand their tier limits, and upgrade self-serve convert faster and complain less.
A customer-facing portal that shows real-time usage, current plan, and available upgrades closes a loop most SaaS billing setups leave open. It also reduces support load: fewer "what am I being charged for" tickets.
1. Pick a platform you can price-test without migrations. Pricing experimentation is a competitive advantage. A platform that requires an engineering sprint every time you want to change a tier or add a usage dimension kills that advantage. Look for no-code plan editors and sandbox environments that let you test pricing changes before they go live.
2. Automate dunning before you need it. Most teams set up dunning reactively, after they've already lost revenue to failed payments. Set it up on day one: automated retries on a sensible schedule, email sequences that escalate without being aggressive, and clear escalation logic for enterprise accounts that need a human touch.
3. Treat MRR as a lagging indicator, not the signal. MRR tells you where you are. Usage data tells you where you're going. Companies that only watch MRR miss expansion signals (customers approaching tier limits) and churn signals (customers whose usage is declining) until it's too late to act on them.
4. Don't let billing become an engineering bottleneck. Finance wants to change pricing. Product wants to add a new tier. Engineering is blocked three sprints out. This is the most common billing failure mode in growth-stage SaaS. An API-first, developer-friendly billing platform, one your team can configure without a professional services engagement, is the difference between pricing as a growth lever and pricing as a quarterly negotiation.
5. Solve tax compliance once, not per-market. EU VAT, US sales tax, GST — each jurisdiction has different rules, and they change. Building compliance logic per-market is a treadmill. Platforms with built-in tax automation (or clean integrations with Anrok/Avalara) let you expand globally without rebuilding your billing stack each time.

Unmetered overages. A customer uses more than their plan allows. Nobody bills for it. The fix is real-time usage tracking with automated overage rules, not a quarterly reconciliation.
Unbilled mid-cycle changes. A customer upgrades in the middle of a billing cycle. Proration is calculated wrong or skipped. Multiply that across hundreds of customers and you have a revenue leakage problem. Automated proration logic, not manual adjustments, is the only sustainable fix.
Stale payment methods. Cards expire. Bank accounts change. Without proactive payment method reminders and smart retry logic, failed payments turn into involuntary churn. Most dunning sequences retry too late and too infrequently.
Global expansion adds a fourth layer: multi-currency invoicing, local tax compliance, and multi-entity support for customers with subsidiaries. Start with your highest-revenue markets and build the compliance architecture once — not per country.
Usage-based monetization is now mainstream. Roughly 39% of SaaS companies price primarily on usage — a sharp increase from a decade ago.⁶ The AI wave is accelerating this further.
AI products introduced something genuinely new to SaaS billing: real COGS. Token costs, GPU-seconds, inference compute — these are variable costs that map directly to usage. Billing systems designed for flat-rate subscriptions weren't built for millisecond-level metering of elastic workloads. Most of them can't do it accurately at scale.
The shift has two implications. First, billing infrastructure matters more than it used to — metering accuracy is revenue accuracy. Second, prepaid credits have become a standard pricing pattern for AI companies because they give customers predictability without forcing vendors to eat margin variability.
Lago is open-source billing infrastructure built for teams that can't afford a black box. Every billing rule is inspectable. The platform runs on your infrastructure if you need it. No revenue share. Payment-processor agnostic.
For teams building on AI or hybrid pricing models, Lago handles real-time event metering, hybrid plan logic, usage-based billing, and global compliance in one system — without the professional services overhead of legacy vendors.