Skip to content
Back to Blog

Building in Public: An Open Letter to Agency Owners Sitting on the AI Sidelines

AgencyBoxx Team
Building in Public: An Open Letter to Agency Owners Sitting on the AI Sidelines

We are going to tell you almost everything. This is what building in public as an AI agency looks like when you decide that transparency is the strategy, not the risk.

How we built an AI operations system that runs 20 agents across 75+ agency clients. How it processes 700+ email actions a day. How it monitors every shared inbox every 60 seconds. How it enforces time tracking compliance, enriches prospects, drafts replies in six different voice modes, catches SLA breaches before they happen, and heals itself when services crash. How the whole thing runs on a $2,000 machine for about a dollar a day in AI costs.

We are going to share the architecture, the agent roster, the trust model, the cost structure, the mistakes we made, and the lessons we learned the hard way. Not in vague terms. In specifics. With real numbers from a real production system serving real clients.

This is not an accident. It is a strategy. And this post explains why.

Key takeaway: Sharing the full system architecture openly is a strategic advantage, not a competitive risk, because the moat is 200+ hours of implementation, 50,000+ lines of production code, and 12 months of refinement that no blog post can replicate.

The Secret Sauce Is Not a Secret

Anyone can download OpenClaw today. It is open source. The framework is free. The documentation is public. Claude Code can handle most of the setup. A Mac Studio with decent specs is the only hardware investment. The technical barrier to getting a basic agent running has never been lower.

Anyone can sign up for Make.com and build automations. Anyone can read the HubSpot API documentation. Anyone can set up Ollama and run local models. Anyone can spin up a ChromaDB instance and start indexing documents. None of the individual components we use are proprietary, secret, or hard to find.

So why share all of this openly?

Because the components are not the moat. The implementation is.

We have spent 200+ hours building, breaking, fixing, and refining this system. 50,000+ lines of production code across 200+ custom Python files. 50+ always on services that have been stress tested on real client data for over 12 months. Nine agents that have been through multiple code audits, security remediations, and architectural overhauls. A knowledge base with 33,700+ indexed chunks that took months to curate, clean, and organize. A correction learning loop with 457+ learned email domains and per client voice calibration data that accumulates with every human edit.

None of that can be replicated by reading a blog post. The ideas are not rare. People who have actually implemented them at production scale on real agency clients are.

That is why we can talk about all of it. The moat is not secrecy. The moat is having already done the work. As David Ward, Meticulosity's founder, puts it: "We're the HubSpot agency for agencies. We help agencies say yes to every deal."

5 reasons building in public creates competitive advantage:

  1. Category definition. The first team to talk openly and specifically about a new capability gets to define the category. Everyone who follows is reacting to your framing.
  2. Trust at scale. Prospects, partners, and platforms trust teams that show their work. Vague claims about "AI-powered operations" carry no weight compared to published architecture and real numbers.
  3. Talent attraction. Engineers and operators want to work on interesting problems. Publishing those problems publicly is the most effective recruiting signal in the market.
  4. Feedback velocity. Every published detail invites scrutiny, questions, and suggestions from practitioners who have solved adjacent problems. The feedback loop accelerates refinement.
  5. Implementation moat. The more openly you share, the more obvious it becomes that knowing the architecture is not the same as having built it. Transparency makes the execution gap visible, which strengthens your position rather than weakening it.

The Window Is Open Right Now

We have been in agency operations for 17 years. Twelve of those as a HubSpot partner. We have seen technology shifts come and go: the COS becoming CMS Hub becoming Content Hub, the rise of inbound, the automation wave, the MarTech consolidation cycle. In every one of those shifts, there was a window where early movers built lasting advantages and everyone else spent years trying to catch up.

This is one of those windows.

AI agents for agency operations are not speculative. They are not a bet on future technology. They are running in production today, on real clients, delivering measurable results. The agencies that figure this out in 2026 will operate at a fundamentally different cost structure and capacity level than the ones that wait until 2028 to start experimenting.

"The agentic AI market will reach $196.6 billion by 2034 at 43.8% CAGR. The window for early movers is closing." -- Fortune Business Insights

The trajectory is not a question. The question is which agencies will be positioned to ride it.

The math is simple. Our system recovers 47 to 82 hours per week of operational capacity. At a $75/hour blended agency rate, that is $183,000 to $319,000 in annual value. Equivalent to 1.5 to 2.5 full time operations hires that never take sick days, never need onboarding, and get better at their jobs every week through the correction learning loop.

An agency that has this capacity advantage today compounds it over 12, 24, 36 months. Their delivery costs go down. Their response times go up. Their team focuses on strategy and relationships instead of email triage and time tracking compliance. Meanwhile, their competitors are still assigning a senior PM to manually check who logged time yesterday.

The window will not stay open forever. OpenClaw is gaining traction. Other frameworks are emerging. The agency owners who are currently saying "we are not touching that with a ten foot pole" will eventually come around. When they do, the agencies that started 18 months earlier will have the production data, the refined workflows, the calibrated voice models, and the institutional knowledge base that newcomers will spend months building from scratch.

First movers do not win because they have better technology. They win because they have more time under load.

Why Other Agency Owners Are Scared

We brought up AI agents at an agency owner mastermind group recently. Eighty to a hundred agency owners, all self identified as AI forward. When we asked who was experimenting with OpenClaw or local AI agents, the room went quiet. Not one hand went up. The prevailing sentiment was: too risky, too complex, too much unknown.

We get it. The fear is not irrational. It comes from real concerns:

"What if the AI sends something wrong to a client?" Valid concern. That is why nothing sends without human approval. The entire architecture is designed around the principle that AI agents draft and humans approve. The fear of a rogue email is eliminated by design, not by hope. We wrote an entire breakdown of why we never let agents send emails.

"What if client data leaks between accounts?" Valid concern. That is why the system enforces data isolation at the database level with namespace separation, blocklist scanning, and identity injection. Cross client contamination is not prevented by being careful. It is prevented by architecture.

"What if it breaks and nobody notices?" Valid concern. That is why a watchdog service monitors every running service every 60 seconds, auto restarts failures, and escalates to a critical alert channel when something cannot self heal. The system is designed to assume things will break and contain the damage automatically.

"I do not have 200 hours to build this." Valid concern. You probably do not. And you should not have to. That is exactly the problem AgencyBoxx exists to solve. But even if you never use our product, the point stands: the risks that scare people are engineering problems with engineering solutions, not existential threats to the business.

"I tried the AI built into HubSpot/ClickUp/whatever and it was not that impressive." That is because baked in AI is a general purpose feature designed for millions of users. It cannot be optimized for your specific workflows, your knowledge base, your client communication style, or your operational processes. Purpose built agents with controlled context and model routing produce dramatically better results. The gap between "I tried Breeze and it was meh" and "I have 20 custom agents running my operations" is the gap between a feature and a system.

Every one of these concerns has a concrete, production tested answer. The agencies that move past the fear are the ones that realize the risk of inaction is larger than the risk of experimentation.

The Risk of Doing Nothing

The conversation about AI in agencies is usually framed as: what is the risk of adopting AI? What could go wrong?

Nobody asks the opposite question: what is the risk of not adopting AI?

Here is what that risk looks like over the next 24 months:

Your competitors will serve more clients with fewer people. An agency running AI operations can handle 75+ clients with a lean team. An agency doing everything manually needs to hire proportionally. When a prospect asks for a quote and one agency comes in 30% lower with faster turnaround, they are not undercutting on quality. They are operating at a different cost structure.

Your best people will burn out on work they should not be doing. Senior PMs checking time tracking logs. Founders spending 90 minutes a day on email triage. Client experience managers manually scanning 30 inboxes every morning. These are high value people doing low value work because nobody built the system to handle it. AI agents do not eliminate jobs. They eliminate the parts of jobs that make talented people want to quit. We wrote about that distinction in detail: we save 400+ hours a month and have not fired a single person.

Your institutional knowledge will remain trapped. Every time a senior employee leaves, they take years of client history, process knowledge, and relationship context with them. A knowledge base captures that information before it walks out the door. An agency without one rebuilds institutional knowledge every time someone leaves.

Your response times will fall behind. An agency with AI monitoring every inbox every 60 seconds and drafting SLA responses at the 4 hour mark will always respond faster than one relying on humans to notice that an email has been sitting unanswered. Clients will not know why one agency feels more responsive. They will just notice the difference.

Your prospecting pipeline will stay thin. The agencies that automate BDR work will have a steady stream of enriched, validated prospects flowing into their pipeline while everyone else is toggling between LinkedIn and Hunter.io when they can find the time, which is never.

"The cost of inaction compounds silently. By the time you notice the gap, the agencies that moved first have 18 months of production data you cannot buy." -- David Ward, CEO of Meticulosity

The risk of AI adoption is a bad email that gets caught by the approval framework. The risk of not adopting AI is a slow, compounding erosion of competitive position that is invisible until it is too late to recover.

What "Building in Public" Means for Us

We are sharing this not because we are generous. We are sharing this because we believe the first people who talk openly about what they have built are the ones who build equity in the market.

The ideas behind AI agency operations are not proprietary. Two agencies that have never communicated independently built nearly identical systems. The problems are universal. The solutions are converging. Keeping it all quiet does not protect an advantage. It just means someone else gets to be the one who talks about it first.

AgencyBoxx serves 60+ agency clients representing approximately 7% of English-speaking tiered HubSpot partners globally. That is not a projection. That is production data from a system that has been running for over a year. The scale validates the approach: this is not a prototype or a proof of concept. It is an operating platform.

So here is what we are going to do:

Building in public as an AI agency means we are going to keep publishing detailed breakdowns of how this system works. The agent architectures. The cost optimization strategies. The trust model. The security layers. The knowledge base design. The lessons we learned from a year of production operation.

We are going to share real numbers. Not projections or estimates. Production metrics from a system running on 75+ clients today.

We are going to be honest about what does not work yet, what is still on the roadmap, and where we got it wrong. Building in public means showing the dents, not just the polish.

And we are going to do this knowing that some readers will take what they learn and build their own systems. That is fine. That is the point. The more agencies operating with AI powered operations, the better the entire ecosystem gets. Competition on execution quality is healthy. Competition on who can hide information the longest is not.

The Invitation

If you are an agency owner who has been circling the AI agent conversation for months, this is the moment to stop circling.

You do not need to build what we built. You do not need to spend 200 hours on implementation. You do not need a computer science degree or a machine learning background. You need to understand that the agencies around you are starting to operate differently, and the gap between "AI assisted" and "manual everything" is going to widen faster than most people expect.

Start small. Get a time tracking agent running. Automate your email triage. Index your HubSpot documentation into a local knowledge base. Pick one operational pain point and solve it. See what 15 minutes a day of recovered time feels like. Then pick another. To understand why 95% of agency owners will not build this, and why that creates the opportunity, the reasoning is worth reading.

Or skip the build phase entirely and talk to us. We already did the work. Twelve months of production refinement. 75+ clients. 50,000+ lines of code. Battle tested, self healing, human approved. That is what AgencyBoxx is. Learn more about us and see the agent roster behind the platform.

Either way, stop sitting on the sidelines. The window is open. The tools are available. The playbook is right here.

The only thing you cannot do is wait and still be first.

Frequently Asked Questions

Why build in public as an agency?

Building in public creates market equity. The first people who talk openly and specifically about what they have built are the ones who define the category. In agency AI operations, the components are all open source and publicly available. The moat is not the technology; it is having already implemented it at production scale. Sharing the details does not erode the advantage because the advantage is 200+ hours of refinement, 50,000+ lines of code, and 12+ months of production data that cannot be replicated from a blog post. The agencies that talk about their work attract clients, partners, and talent. The ones that stay quiet miss the window.

Is sharing your AI operations strategy a competitive risk?

No. Two agencies that have never communicated independently built nearly identical systems, which proves the ideas are not proprietary. The problems agencies face are universal, and the solutions are converging. Keeping the approach secret does not protect an advantage because competitors will arrive at the same architecture through their own experimentation. What cannot be replicated quickly is the production data, the per-client voice calibration, the 457+ learned email domains, and the institutional knowledge base with 33,700+ chunks. Those assets compound over time and create a widening gap regardless of who knows the architecture.

What does it take to build AI operations from scratch?

Our system required 200+ hours of development, 50,000+ lines of custom Python code, and 12+ months of production refinement across 75+ agency clients. The hardware investment was approximately $2,000 CAD for a Mac Studio. The ongoing AI operating cost is about $2.50 per day. The non-obvious costs are the debugging, the security remediations, the architectural overhauls, and the correction learning data that only accumulates through months of real client operations. This is why AgencyBoxx exists: to eliminate the build phase for agencies that want the capability without the 200-hour investment.

How do you create a new category in agency technology?

You solve a problem that nobody else is talking about yet, and you do it with enough specificity that the market cannot ignore it. For us, that problem was AI-powered agency operations at production scale. The category did not exist 18 months ago. We created it by building a working system, running it on real clients, measuring the results, and publishing the details. Category creation requires being early enough that the space is open, specific enough that the value proposition is clear, and transparent enough that the market trusts the claims. Building in public is not just a marketing strategy. It is how new categories get established.

AgencyBoxx is an AI operations platform built inside a real agency over 12+ months on 75+ real clients. We are building in public because the moat is the work, not the secrecy. Book a Walkthrough to see what a year of production refinement looks like.