Syntherra.ai · Pricing · Managed Agents, Private LLMs, Consulting

Pricing

$0 to build.
Pay when it runs.

We build your AI agent for free. Pick the plan that fits how it runs: a shared LLM for speed, or a private LLM for full data control. Prefer to build in-house? Ask about consulting.

Monthly

Annual SAVE 20%

✦ Managed Agents: we build and host your AI agent on our cloud, connected to a shared LLM (Claude, GPT-4, Gemini). Usage-based, no upfront cost. Best for teams that want to ship fast without running their own infrastructure.

Starter

Light AI usage. Single-purpose agents with occasional LLM calls.

Starting at$99/mo

$0 upfront · shared LLM included

1 custom agent, fully built
Up to 500K LLM tokens/mo
5 GB storage · shared compute
Usage dashboard + spend caps
Evals + monitoring
Bug fixes and minor tuning
Multi-agent workflows
Uptime SLA

Get Started

Most Popular

Deploy + Agent

A dedicated single-tenant model plus a custom AI agent built on top. The full stack.

Starting at$999/mo

$0 upfront · dedicated model + agent

Mid-size open-weights model (e.g. Llama 3 70B, Mixtral, Qwen 32B)
1 custom agent built & deployed
RAG + vector store included
Light fine-tuning on your data
Priority build · 99.5% SLA
Dedicated Slack support
Deployment in your own cloud account

Get Started

Enterprise

Dedicated deployments for regulated industries with strict compliance and data-residency needs.

Custom pricing

Tailored to your compliance and scale needs

Any model: open-weights or closed-source via private endpoint (Claude, GPT-4, Gemini)
Hosted in our cloud or yours
Choice of region for data residency
Multiple agents + multi-agent workflows
Full fine-tuning + custom evals
SOC 2 / HIPAA / GDPR support
Dedicated account manager · 99.9% SLA

Talk to Sales

Deploy

Small model · Isolated endpoint · 1 deployment

Deploy + Agent

Mid model · Custom agent · RAG · 99.5% SLA

Enterprise

Any model · Region of choice · 99.9% SLA

Private LLM add-ons

Extend any deployment à la carte. Billed monthly.

Fine-tuning

Fine-tune your private model on proprietary data for better accuracy.

Extra throughput

Additional GPU capacity for higher concurrent request volume.

Extra agents

Additional custom agents built on top of your private model.

Priority support

1-hour SLA with a dedicated Slack channel and on-call engineer.

Prefer to build in-house? We consult too.

Time & materials, billed by the hour, day, or week for actual work delivered. No tiers, no fixed packages, no retainers required.

How it works

Hands-on AI agent guidance, billed for the time we actually work.

We scope the engagement together and bill only for hours delivered. Short reviews go hourly, focused sprints go daily, longer collaborations go weekly. Rates discount as the commitment grows. Stop or pause anytime.

What we typically help with

Agent architecture & production-readiness reviews
Managed-cloud-vs-private-LLM evaluation
Model & framework selection
Prompt & eval design
Embedded engineering on your codebase
Team workshops & training

Book a Scoping Call

Common pricing questions

Everything you need to know before committing.

How do you make money if the build is free? +

We make money on running the agent, not on building it. Both Managed Agents and Private LLM agents are billed monthly based on actual usage, covering inference, compute, and storage in one line item. Consulting is billed time & materials, hourly, daily, or weekly. Over the lifetime of the engagement, we earn back our build investment, and we're incentivized to keep your agent running efficiently for years.

What's the difference between Managed Agents and Private LLM? +

It's about the deployment, not the model. Managed Agents use a shared, multi-tenant LLM API: fastest to ship, billed by usage. Private LLM runs your model of choice on dedicated, single-tenant infrastructure we provision just for you, whether open-weights (Llama, Mistral, Qwen) or closed-source via private endpoint (Claude through Anthropic enterprise, GPT-4 through Azure OpenAI, Gemini through Vertex AI, etc.). Same cloud convenience, with full data isolation. Essential for regulated industries or sensitive IP.

Can I start on Managed Agents and switch to Private LLM later? +

Yes. You can upgrade any time. We'll provision the private model, port the agent logic over, and run evals so behavior stays consistent. Typical migration takes 1–2 weeks.

Can you just consult instead of building? +

Yes. Consulting is billed time & materials, by the hour, day, or week, so you only pay for actual work delivered. Common engagements include architecture reviews, model-selection guidance, eval design, and embedded engineering. A good fit for organizations with established engineering teams who want expert review or additional implementation capacity without handing over the entire build.

What if I want to move off Syntherra? +

You own your agent from day one: prompts, code, evals, fine-tuned weights, and data. We'll export everything at any time. There's no lock-in; we keep clients by being the best option, not by holding code hostage.

Is there a minimum contract length? +

Managed Agents and Private LLM plans require a 3-month minimum to cover our build investment. Annual plans get 20% off. Consulting is time & materials with no required minimum; stop or pause any time. After the plan minimum, you can cancel with 30 days' notice. No penalties.

$0 to build.Pay when it runs.

Describe your idea.We'll scope it today.

$0 to build.
Pay when it runs.

Describe your idea.
We'll scope it today.