BUILD PLAYBOOKS & PRODUCTION LESSONS

The AI Agent Blog

Practical writing on shipping AI agents to production. From a team that has shipped 80+ agents across fintech, SaaS, and enterprise tech.

Build vs Buy8 min read

AI Agent vs RPA vs Workflow Automation: Which Should You Use?AI Agent vs RPA vs Workflow Automation: Which Should You Use?

RPA replays mouse clicks. Workflow automation runs predefined branches. AI agents reason about novel situations. Each fits a different problem — and choosing wrong wastes budget and burns trust with your team.

James PerkinsRead article

Build Playbook7 min read

Why 95% of AI POCs Never Reach ProductionWhy 95% of AI POCs Never Reach Production

Most AI proofs-of-concept die for predictable reasons that have nothing to do with the underlying model. Here are the five killers we see most often, and how to design around each one before you start.

James PerkinsRead article

Cost & ROI9 min read

The Real Cost of an AI Agent: Beyond OpenAI TokensThe Real Cost of an AI Agent: Beyond OpenAI Tokens

The token bill is the smallest line item. Here is the full cost model for a production AI agent — what to budget, what surprises teams, and where the real money goes.

James PerkinsRead article

Build vs Buy7 min read

Build vs Buy: When to Build Your Own AI Agent (and When Not To)Build vs Buy: When to Build Your Own AI Agent (and When Not To)

Build when the workflow is your moat, the data is sensitive, or no vendor sells what you need. Buy everywhere else. Here is the decision framework that has saved our clients millions in over-built bespoke agents.

James PerkinsRead article

Case Study8 min read

5 AI Agent Failures We've Seen in Production (and How to Avoid Them)5 AI Agent Failures We've Seen in Production (and How to Avoid Them)

Five concrete production failures we have personally encountered or rescued — what went wrong, what shipped to fix it, and what to put in place from day one to avoid each.

James PerkinsRead article

Build Playbook6 min read

How to Run a 1-Week AI Agent Pilot That Actually Ships to ProductionHow to Run a 1-Week AI Agent Pilot That Actually Ships to Production

Most AI pilots stall in evaluation purgatory. The teams that actually ship use a tight 5-day playbook with non-negotiable scope, eval set written before the prompt, and a production-readiness gate baked in from day one.

James PerkinsRead article

Build Playbook6 min read

AI Agent Frameworks Compared: LangChain vs CrewAI vs CustomAI Agent Frameworks Compared: LangChain vs CrewAI vs Custom

We've built production agents on all three. Here is when to use LangChain, when CrewAI is the better call, and when both are wrong and you should write the agent loop yourself in 200 lines of Python.

James PerkinsRead article

Build Playbook6 min read

What Founders Get Wrong About AI in Their First YearWhat Founders Get Wrong About AI in Their First Year

Common patterns we see when founders first try to use AI in their company. Most of the mistakes are predictable — and most are correctable in a single conversation.

James PerkinsRead article

Case Study7 min read

The AI Customer Support Agent Playbook (What Sierra, Decagon, and Klarna Got Right)The AI Customer Support Agent Playbook (What Sierra, Decagon, and Klarna Got Right)

Klarna replaced 700 reps with one AI agent. Sierra and Decagon are powering some of the largest support orgs in tech. Here is the pattern they share — and how to apply it without the public stumbles others have had.

James PerkinsRead article

Build Playbook6 min read

Vibe-Coding vs Building Production AI Agents: When Each Makes SenseVibe-Coding vs Building Production AI Agents: When Each Makes Sense

Vibe-coding is real, useful, and producing actual revenue. So is rigorous production AI agent engineering. The two patterns serve different goals — and conflating them is how teams waste budgets.

James PerkinsRead article