All reports
Product Strategy by deep-research

Service-as-Software — Pricing Models for AI Agents That Replace SaaS

OctantOSAgentScope

Service-as-Software — Pricing Models for AI Agents That Replace SaaS

Research date: 2026-03-19 | Agent: Deep Research | Confidence: High

Executive Summary

  • The “SaaSpocalypse” is real but nuanced — software stocks lost ~$2 trillion in early 2026, but global SaaS spending is still projected to reach $512B by 2028; the model is transforming, not dying
  • Foundation Capital’s $4.6T “service-as-software” thesis is now the default B2B AI investing frame — AI delivers outcomes, not tools, targeting the services market (4x larger than software)
  • Outcome-based pricing is the fastest-growing model — Intercom Fin ($0.99/resolution), Zendesk ($1.50–$2.00/resolution), and Sierra (custom) prove it works; Gartner projects 40% of SaaS contracts will include outcome-based components by 2026
  • AI agent startups are hitting unprecedented growth — Cursor $1B ARR in 24 months, Sierra $100M in 21 months, Devin from $1M to $73M in 9 months
  • For Moklabs, the hybrid model is optimal — base platform fee + usage/outcome tiers, aligning spend with value while maintaining revenue predictability

Market Size & Growth

The “Service-as-Software” Opportunity

MetricValueSource
Global services market (target)$4.6 trillionFoundation Capital
Global SaaS spending (2025)$318 billionIntellectia
Global SaaS spending (2028 est.)$512 billionIntellectia
Global SaaS spending (2029 est.)$576 billionIntellectia
Software market cap lost (Feb 2026)~$2 trillionTechCrunch
AI agents market (2025)$7.6–7.8 billionAI Funding Tracker
AI agents market (2026 est.)$10.9 billionAI Funding Tracker
AI agents market (2030 est.)$52.6 billionAI Funding Tracker
AI agents CAGR46.3%Salesmate
Enterprise apps embedding agents by 202640% (up from <5% in 2025)Gartner via Salesmate
SaaS contracts with outcome-based components by 202640%Gartner via Chargebee

Fastest-Growing AI Agent Companies

CompanyDomainARRGrowth RateValuationFunding
Cursor (Anysphere)Coding$1B+ (Nov 2025)$100M → $1B in 10 months
HarveyLegal$195M (end 2025)3.9x YoY$8B → $11B (Feb 2026)$960M+ total
SierraCustomer Service$100M (21 months)~$10B
GleanEnterprise Search$200M+Doubled in 9 months$7.2B$150M Series F
Cognition (Devin)Coding$73M (Jun 2025)$1M → $73M in 9 months
ElevenLabsVoice AI$330M+ (end 2025)3x valuation in months$11B$500M Series D

Key Players & Pricing Models

The Eight Pricing Structures

ModelHow It WorksBest ForExample
Per-SeatLicense per user (human or AI “seat”)Stable teams, predictable usageTraditional SaaS, some AI wrappers
Per-AgentLicense per deployed AI agentMulti-agent orchestrationOctantOS potential model
Usage-BasedPay per token, API call, or computeDeveloper tools, APIsOpenAI, Anthropic
Per-WorkflowPay per automated workflow executionProcess automationMake, Zapier
Per-OutputPay per generated artifactContent/code generationImage generators
Outcome-BasedPay per successful resultCustomer service, salesIntercom Fin, Sierra, Zendesk
Subscription (Flat)Fixed monthly/annual feeAll-you-can-eat usageChatGPT Plus
HybridBase subscription + usage/outcome tiersMost AI-native companies (recommended)Devin, Relevance AI

Real-World Pricing Deep Dives

Intercom Fin — The Outcome-Based Pioneer

  • Price: $0.99 per resolution (successful outcome)
  • How it works: You pay only when Fin fully resolves a customer issue without human intervention
  • Minimum: 50 resolutions/month
  • Standalone: Fin can be purchased without the Intercom platform at $29/mo
  • Key insight: Charged once per conversation, even if Fin resolves multiple questions
  • Source: Intercom

Zendesk AI — Dynamic Budget Remix

  • Price: $1.50/resolution (committed) or $2.00/resolution (pay-as-you-go)
  • Innovation: Dynamic Pricing Plan lets you shift budget between human seats and AI resolutions during your contract — no renegotiation needed
  • Example: Start at 20% automation → ramp to 60% → shift budget accordingly
  • Key insight: Bridges the transition from per-seat to per-resolution without forcing customers to choose one model
  • Source: Zendesk

Sierra AI — Custom Outcome-Based

  • Price: Custom per-resolution pricing (quote-based, ~$150K+ annually)
  • Model: Gets paid only when completing a task for the customer
  • Flexibility: Blended pricing for different interaction types — outcome-based for resolutions, consumption-based for routing/greeting
  • Growth: $100M ARR in 21 months proves the model works
  • Source: Sierra

Devin (Cognition) — Agent Compute Units

  • Tiers: Core ($20/mo), Teams ($500/mo), Enterprise (custom)
  • Unit: Agent Compute Unit (ACU) — ~15 min of active work
  • Price per ACU: $2.25 (Core) / $2.00 (Teams, 250 included)
  • Key insight: Normalized unit that abstracts away token costs, VM time, and inference — customers pay for “work time” not infrastructure
  • Growth: $1M → $73M ARR in 9 months
  • Source: Devin
  • Price: $1,000–$1,200/lawyer/month, 25–50+ seat minimums
  • Annual contracts: $30K–$300K+
  • Growth: Median seat count doubles within 12 months of deployment
  • Key insight: Seat-based works in regulated, high-value verticals where per-outcome measurement is complex
  • Source: Sacra
  • Price: ~$45–50+/user/month base, +$15/user/month for AI add-ons
  • Minimum: $50K–$60K annual contract (100+ seats)
  • PoC cost: Up to $70K
  • Key insight: Enterprise-first, quote-based — high ASP justified by “find anything in your company” value prop
  • Source: GoSearch

Technology Landscape

The “Service-as-Software” Thesis Explained

Traditional model (Software-as-a-Service):

Customer buys a tool → Customer is responsible for using it → Customer achieves outcome (or doesn’t)

New model (Service-as-Software):

Customer describes desired outcome → AI agent delivers the outcome → Customer pays for result

The key shift: responsibility for achieving the outcome moves from the customer to the vendor. Instead of QuickBooks (tool), you get an AI accountant (service). Instead of Zendesk (tool), you get Sierra (service).

Why This Matters for Pricing

FactorSaaS PricingService-as-Software Pricing
Value unitSeat / user accessOutcome / completed work
Revenue scales withHeadcount growthWork volume / automation rate
Customer riskHigh (pay even if unused)Low (pay only for results)
Vendor riskLow (predictable revenue)Higher (must deliver outcomes)
Margin model70–85% gross marginVariable (depends on cost per outcome)
Expansion motionAdd seatsIncrease automation scope
Churn riskHigh if underutilizedLow (tied to business value)

The Three Value Axes for Pricing Decisions

Per Chargebee:

  1. Value Attribution — How easily can customers tie agent outputs to outcomes? (High = outcome-based pricing viable)
  2. Execution Autonomy — How well does the agent solve problems without human-in-loop? (High = per-resolution pricing viable)
  3. Workload Predictability — How spiky is the effort per instance? (Low = usage-based better than flat-rate)

The Bessemer Hybrid Recommendation

Bessemer Venture Partners recommends hybrid models for most AI companies:

  • Base subscription ensures revenue predictability for both vendor and customer
  • Usage/outcome tiers capture upside as customer value grows
  • Best for: Uncertain or early markets, quantifiable verticals (legal, support, sales)
  • Real example: EvenUp and Legora (legal AI) use hybrid pricing in quantifiable verticals where AI output ties to specific outcomes

Pain Points & Gaps

For AI Agent Vendors

  1. Margin uncertainty with outcome-based pricing — Any work consumed by the agent in attempting a solution goes under-monetized when only successful outcomes are charged. Failed attempts still cost tokens.

  2. Definition of “resolution” is contentious — What counts as a successful resolution? Intercom charges when the customer doesn’t request more help; Zendesk requires full resolution without human intervention. Different definitions → different economics.

  3. Cost-of-goods-sold unpredictability — Complex queries might cost 10x more tokens than simple ones, but the charge is the same per resolution. Without robust cost attribution (see Agent Economics report), margins are opaque.

  4. Usage-based pricing surprises — 78% of IT leaders reported unexpected charges on SaaS due to consumption-based or AI pricing models (per Zylo). Trust erosion is a real risk.

For Enterprise Buyers

  1. Budget unpredictability — Hard to forecast costs when pricing varies with automation rate and resolution volume
  2. Vendor lock-in concerns — Outcome-based models often require deep integration, making switching costly
  3. Measuring true ROI — If one AI seat replaces 5 human seats but costs 3x a human seat, is that good? Depends on quality, reliability, and edge cases.
  4. “AI-washing” in pricing — Some vendors just add a chatbot overlay and call it “AI-powered” while charging premium prices

Underserved Segments

  • SMBs with <50 employees — Most AI agent platforms require $50K+ annual contracts. The $20–50/mo tier is thin.
  • Platform operators (like Moklabs) — Need pricing infrastructure (metering, billing) built into the orchestration layer, not bolted on
  • Vertical-specific pricing — Generic models don’t capture domain-specific value (legal resolution vs. support resolution vs. code task)

Opportunities for Moklabs

1. Built-in Usage Metering & Billing in Paperclip (High Impact / High Strategic Value)

Current state: Paperclip tracks agent budgets and costs but has no customer-facing billing/metering.

Opportunity: If OctantOS customers deploy agents for their end users, they need to meter and bill for agent work. Building metering primitives into Paperclip means:

  • Track ACUs (Agent Compute Units) per task/issue
  • Generate usage reports per customer/project
  • Support configurable pricing models (per-task, per-outcome, hybrid)
  • Export to billing systems (Stripe, Orb, Flexprice)

Why it matters: This is the “Stripe for AI agents” play — enabling OctantOS customers to monetize their agent deployments with built-in pricing infrastructure.

Estimated effort: 4–6 weeks for core metering, 2–3 months for full billing integration

2. Hybrid Pricing for OctantOS (High Impact / Medium Effort)

Recommended pricing structure (building on MOKA-57 proposal):

TierBase FeeIncludedOverage
Free$03 agents, 100 tasks/moN/A
Pro$39/user/mo10 agents, 1,000 tasks/mo$0.10/task
Enterprise$999/mo + $25/userUnlimited agents, 10,000 tasks/mo$0.05/task + volume discounts

Key design decisions:

  • Base fee anchors revenue predictability (Bessemer recommendation)
  • Per-task overage captures expansion without surprising customers
  • Task-based (not token-based) pricing shields customers from model cost volatility
  • Free tier drives adoption; conversion from free → pro at 100 tasks/mo limit

3. Outcome-Based Pricing Option for Vertical Deployments (Medium Impact / High Effort)

Opportunity: For specific verticals where outcomes are measurable (customer support, code generation, document processing), offer outcome-based pricing as an option:

  • Per-resolution for support agents
  • Per-completed-task for coding agents
  • Per-processed-document for document agents

Why it matters: Zendesk’s “dynamic pricing plan” proves that customers want to shift between seat-based and outcome-based during their contract. Building this flexibility into OctantOS would be a differentiator.

4. Pricing Analytics Dashboard (Medium Impact / Low Effort)

Opportunity: Help OctantOS customers understand their cost-per-outcome and optimize:

  • Cost per resolution/task/outcome breakdown
  • Agent efficiency metrics (cost vs. value delivered)
  • Pricing model comparison tool (“would outcome-based save you money?”)
  • Margin analysis for service-as-software businesses

5. “Devin-Style” ACU Abstraction for Agent Work (High Impact / Medium Effort)

Opportunity: Define an “Agent Work Unit” (AWU) for Paperclip that normalizes:

  • LLM token costs across providers
  • Compute time for tool execution
  • Storage and context costs

Customers think in “work done” not “tokens consumed.” This abstraction enables:

  • Predictable per-task pricing regardless of which models are used
  • Model routing optimizations invisible to the customer
  • Clean billing that non-technical buyers can understand

Risk Assessment

Market Risks

RiskLikelihoodImpactMitigation
Outcome-based pricing fails to scale (too many failed attempts eat margins)MediumHighHybrid model with base fee ensures minimum revenue
Race to bottom on per-resolution pricing ($0.99 → $0.49 → $0.10)MediumMediumCompete on platform capabilities, not price per resolution
Enterprise buyers resist variable pricing (prefer predictable budgets)MediumMediumOffer committed volume discounts (Zendesk model)
SaaS incumbents successfully add AI agents without changing pricingHighMediumMove fast — incumbents are slow to cannibalize per-seat revenue

Technical Risks

RiskLikelihoodImpactMitigation
Defining “outcome” is domain-specific and complexHighMediumStart with clear definitions (task completed, issue resolved)
Usage metering at scale adds latency and complexityMediumLowAsync metering with eventual consistency
Multi-model cost normalization is hardMediumMediumStart simple: cost = tokens × model_price, iterate

Business Risks

RiskLikelihoodImpactMitigation
Moklabs too early for monetization (need users first)HighHighFree tier first, pricing infrastructure as foundation for later
Building billing competes with core product workMediumHighStart with metering only — billing integrates with Stripe/Orb later
Pricing complexity confuses early customersMediumMediumDefault to simple hybrid, advanced options for enterprise

Data Points & Numbers

Pricing Benchmarks by Category

CategoryCompanyModelPrice Point
Customer ServiceIntercom FinPer resolution$0.99/resolution
Customer ServiceZendesk AIPer resolution$1.50–$2.00/resolution
Customer ServiceSierraPer resolution (custom)~$150K+/year
Customer ServiceAdaPer resolutionQuote-based
LegalHarveyPer seat (premium)$1,000–$1,200/lawyer/mo
LegalEvenUpHybridBase + outcome
CodingDevinPer ACU$2.00–$2.25/ACU (~15 min work)
CodingCursorSubscription + usage$20/mo (Pro), $200/mo (Ultra)
Enterprise SearchGleanPer seat (premium)$45–50+/user/mo
Platform/APIOpenAIPer token$2.50–$15/M input tokens
Platform/APIAnthropicPer token$3–$15/M input tokens

Key Financial Metrics

MetricValueSource
IT leaders with unexpected AI charges78%Zylo
Klarna AI assistant impactReplaced ~700 agentsMultimodal
AI agents ROI within year 174% of enterprisesAgility at Scale
Operational cost reduction with AI75% averageAgility at Scale
Customer willingness to pay for AI outcomesGrowing — 40% contracts outcome-based by 2026Gartner

The SaaSpocalypse Numbers

MetricValueSource
Software market cap lost (Feb 2026)~$2 trillionTechCrunch
Market cap erased in 7 days$1 trillion+Digital Applied
AI-native startups launched (18 months)ThousandsFoundation Capital
Global services market (AI target)$4.6 trillionFoundation Capital

Sources

Related Reports