What Your AI Vendor Isn't Telling You About Ongoing Costs
Riverstone Team
Riverstone Labs

Riverstone Team
Riverstone Labs

The implementation quote is the easy number. It fits on a purchase order, it has a start date, and it feels like closure. The harder numbers arrive later: monthly model and API bills that scale with usage, the hour here and there when an integration changes, and the quiet decay of accuracy as your business changes shape.
If you are evaluating AI automation for an Australian SME, you should treat these as normal operating costs, not surprises. Vendors who avoid the topic are not saving you money—they are deferring an argument.
Most production systems that read or write text at volume will incur per-token or per-request charges from a model provider, sometimes plus hosting or orchestration fees. The range can be modest for light workflows and surprisingly steep for document-heavy pipelines or always-on classification.
The mistake is to price the happy path demo. Price your path: peak Monday volume, longest attachments, retries on errors, and any logging you keep for auditability. Ask for a monthly band at stated volumes, not “we’ll optimise later.” Optimisation helps—it does not erase physics.
Integrations break when upstream vendors ship API changes. Webhooks fail when certificates expire. A workflow that “passed UAT” can still halt because a supplier changed a PDF layout.
Someone must own monitoring: alerts when queues stall, daily or weekly summaries of throughput and failure reasons, and a path to fix regressions. That owner might be internal, or it might be a support retainer—but if it is nobody, the system rots in place.
Training data is never static. Product lines change. Promotions change. Customer language changes. The classifier you tuned in one quarter will be wrong more often six months later if nobody retrains thresholds, updates examples, or adjusts prompts.
This is not a moral failure of AI. It is operations. Budget time or money for periodic review the same way you would for any business-critical workflow—because that is what it is.
Maintenance is notoriously variable, but boards like a line item. A practical planning band used in many software contexts is 15–25% of the initial build cost per year for monitoring, small fixes, and incremental improvements—more if you are expanding scope, less if the workflow is narrow and stable. Treat it as a budget placeholder to stress-test affordability, not a promise.
If the project only “pays back” when maintenance is assumed at zero, the project is not funded honestly.
Ask plainly:
Answers should be specific enough to write into a responsibility matrix. Handwaving here tends to match handwaving in delivery.
Also clarify intellectual property and portability. If you part ways with a vendor, do you keep prompts, tests, and integration code in a repo you control? Exit costs are part of TCO, even if they never appear on a monthly invoice.
Finally, align on incident response. When the system misroutes a payment request or mislabels a complaint, who is paged, what is rolled back, and how is the customer corrected? Runbooks are not glamorous; they are what separate a bad afternoon from a bad quarter.
Cash flow discipline is unforgiving. A system that saves labour but introduces unpredictable run costs can still be worth it—but only if the total cost of ownership was visible when you decided.
Finally, tie costs to risk. Low-stakes internal classification can absorb more automation variance than customer-facing replies or payment-adjacent workflows. Your monitoring budget should scale with downside, not with excitement about the model.
When comparing two quotes, normalise to cost per successfully completed item in production—after human review—not cost per line of integration code. Two implementations can price similarly while one burns ten times the review hours because thresholds were set wrong.
Ask for a ninety-day operating plan in plain language: expected volumes, expected API spend band, review queue expectations, and what triggers a scope change. If the vendor cannot write that paragraph, they are not ready to run your production workflow.
Riverstone Labs builds with handover, monitoring, and optimisation in scope so production systems stay production systems. If you want help pressure-testing a vendor quote—or comparing apples with apples—book a free assessment.
Service capability:
Want this implemented in your business? Book a Diagnose call — free 30-minute consultation, no pitch.
Book a free 15-minute assessment. We'll look at your operations and identify the highest-ROI automation opportunities.
Book your free assessment