$0 to build.
Pay when it runs.

We build your AI agent for free. Pick the plan that fits how it runs: a shared LLM for speed, or a private LLM for full data control. Prefer to build in-house? Ask about consulting.

Monthly
Annual SAVE 20%
Managed Agents: we build and host your AI agent on our cloud, connected to a shared LLM (Claude, GPT-4, Gemini). Usage-based, no upfront cost. Best for teams that want to ship fast without running their own infrastructure.
Starter
Starter
Light AI usage. Single-purpose agents with occasional LLM calls.
Starting at$99/mo
$0 upfront · shared LLM included
  • 1 custom agent, fully built
  • Up to 500K LLM tokens/mo
  • 5 GB storage · shared compute
  • Usage dashboard + spend caps
  • Evals + monitoring
  • Bug fixes and minor tuning
  • Multi-agent workflows
  • Uptime SLA
Get Started
Scale
Scale
Teams running multiple agents with enterprise-grade requirements.
Custom pricing
Tailored to your usage and scale
  • Unlimited agents
  • Custom token allocation
  • Isolated infra · custom RAM
  • Custom LLM provider routing
  • 99.9% uptime SLA
  • Dedicated account manager
  • SOC 2 / compliance support
  • White-label option
Talk to Sales
Starter
500K tokens · Shared · 1 agent
Growth
5M tokens · Dedicated · Up to 3 agents · 99.5% SLA
Scale
Custom tokens · Isolated · Unlimited · 99.9% SLA
Managed Agent add-ons
Extend any plan à la carte. Billed monthly, cancel anytime.
Extra tokens
Additional LLM tokens beyond your plan's monthly allocation.
Extra storage
Persistent storage for files, vectors, or database overages.
Extra compute
Dedicated worker node for high-concurrency agent workloads.
Priority support
1-hour response SLA with a dedicated Slack channel.
🔒 Private LLM: a dedicated, single-tenant model deployment we provision and host for you. Open-weights (Llama, Mistral, Qwen) or closed-source via private endpoint (Claude, GPT-4, Gemini through Azure OpenAI, AWS Bedrock, or Vertex AI). Not a public API, not shared with any other customer. Flat monthly fee based on model and throughput. Ideal for regulated industries, sensitive IP, and strict data-residency needs.
Deploy
Deploy
A dedicated, single-tenant model in isolated cloud infrastructure. Ready to use.
Starting at$499/mo
$0 upfront · dedicated inference endpoint
  • 1 dedicated model deployment
  • Small open-weights model (e.g. Llama 3 8B, Mistral 7B, Qwen 7B)
  • Isolated cloud inference endpoint
  • Single-tenant; never shared with other customers
  • Monitoring + usage logs
  • Custom agent built on top
  • Fine-tuning on your data
Get Started
Enterprise
Enterprise
Dedicated deployments for regulated industries with strict compliance and data-residency needs.
Custom pricing
Tailored to your compliance and scale needs
  • Any model: open-weights or closed-source via private endpoint (Claude, GPT-4, Gemini)
  • Hosted in our cloud or yours
  • Choice of region for data residency
  • Multiple agents + multi-agent workflows
  • Full fine-tuning + custom evals
  • SOC 2 / HIPAA / GDPR support
  • Dedicated account manager · 99.9% SLA
Talk to Sales
Deploy
Small model · Isolated endpoint · 1 deployment
Deploy + Agent
Mid model · Custom agent · RAG · 99.5% SLA
Enterprise
Any model · Region of choice · 99.9% SLA
Private LLM add-ons
Extend any deployment à la carte. Billed monthly.
Fine-tuning
Fine-tune your private model on proprietary data for better accuracy.
Extra throughput
Additional GPU capacity for higher concurrent request volume.
Extra agents
Additional custom agents built on top of your private model.
Priority support
1-hour SLA with a dedicated Slack channel and on-call engineer.
Time & materials, billed by the hour, day, or week for actual work delivered. No tiers, no fixed packages, no retainers required.
How it works
Hands-on AI agent guidance, billed for the time we actually work.
We scope the engagement together and bill only for hours delivered. Short reviews go hourly, focused sprints go daily, longer collaborations go weekly. Rates discount as the commitment grows. Stop or pause anytime.
What we typically help with
  • Agent architecture & production-readiness reviews
  • Managed-cloud-vs-private-LLM evaluation
  • Model & framework selection
  • Prompt & eval design
  • Embedded engineering on your codebase
  • Team workshops & training
Book a Scoping Call
Everything you need to know before committing.
How do you make money if the build is free? +
We make money on running the agent, not on building it. Both Managed Agents and Private LLM agents are billed monthly based on actual usage, covering inference, compute, and storage in one line item. Consulting is billed time & materials, hourly, daily, or weekly. Over the lifetime of the engagement, we earn back our build investment, and we're incentivized to keep your agent running efficiently for years.
What's the difference between Managed Agents and Private LLM? +
It's about the deployment, not the model. Managed Agents use a shared, multi-tenant LLM API: fastest to ship, billed by usage. Private LLM runs your model of choice on dedicated, single-tenant infrastructure we provision just for you, whether open-weights (Llama, Mistral, Qwen) or closed-source via private endpoint (Claude through Anthropic enterprise, GPT-4 through Azure OpenAI, Gemini through Vertex AI, etc.). Same cloud convenience, with full data isolation. Essential for regulated industries or sensitive IP.
Can I start on Managed Agents and switch to Private LLM later? +
Yes. You can upgrade any time. We'll provision the private model, port the agent logic over, and run evals so behavior stays consistent. Typical migration takes 1–2 weeks.
Can you just consult instead of building? +
Yes. Consulting is billed time & materials, by the hour, day, or week, so you only pay for actual work delivered. Common engagements include architecture reviews, model-selection guidance, eval design, and embedded engineering. A good fit for organizations with established engineering teams who want expert review or additional implementation capacity without handing over the entire build.
What if I want to move off Syntherra? +
You own your agent from day one: prompts, code, evals, fine-tuned weights, and data. We'll export everything at any time. There's no lock-in; we keep clients by being the best option, not by holding code hostage.
Is there a minimum contract length? +
Managed Agents and Private LLM plans require a 3-month minimum to cover our build investment. Annual plans get 20% off. Consulting is time & materials with no required minimum; stop or pause any time. After the plan minimum, you can cancel with 30 days' notice. No penalties.
Ready to build?

Describe your idea.
We'll scope it today.

No cost. No commitment. Just a 30-minute call and a written brief within 24 hours.