Insights — Practical AI essays for Canadian teams

Q: How do I know if my team is ready to adopt AI?

Three signals: (1) leaders can describe the workflow they want to change in plain English, (2) one named person on the team is willing to own the system after launch, and (3) you can produce twenty real examples of the work the AI will do. If any of these is missing, start with a literacy workshop before a build.

▎ FEATURED DEEP READ

// project lifecycle month 1 ─── build ● month 2 ─── pilot ● month 3 ─── live ● month 4 ─── usage ● month 5 ─── usage ● month 6 ─── usage ◐ month 7 ─── usage ○ ← fails here month 8 ─── usage ○ month 9 ─── shelved month 12 ─── replaced // 73% of AI rollouts fail by month 7 // the predictor is at month 0, not month 6

Fig. 1 · A typical post-launch decay curve. The fix happens before the build, not after the failure.

Field essay · 9 min read · Updated May 2026 · By the Applied AI North team

The Six-Month Rule.

Most AI projects fail at month seven. Not because the model regressed, the API moved, or the team lost interest — but because the question of who pays the cost of the system working was never named at month zero. A short diagnostic for AI projects in production.

Adoption Project scoping Change management

Read the essay ↗

§ 01Comparison Guides

Side-by-side, no fence-sitting.

Long, structured comparisons of the tools, models, and patterns we use daily. Each one ends with a recommendation, not a shrug. Updated whenever a new model release moves the verdict.

Comparison · 14 min · Updated Apr 2026

Claude Sonnet 4.5 vs GPT-5 vs Gemini 2.5 Pro

Which model wins which job, measured against four production workloads: structured extraction, multi-step agentic tool use, long-document Q&A, and cost-per-task at scale.

TaskSonnet 4.5GPT-5Gemini 2.5

Extraction●●●●●○●●○

Agentic tools●●●●●●●●○

Long-doc Q&A●●○●●○●●●

Cost / 1M tok$3.00$5.00$1.25

BenchmarksCost analysisProduction

Read the comparison ↗

Comparison · 11 min · Updated Mar 2026

n8n vs Make vs Zapier for AI workflows

When to reach for which workflow tool when LLM steps are part of the equation. Self-hosting, branching logic, vendor lock-in, and the breakpoint where a workflow tool stops being the right answer.

Best forn8nMakeZapier

Self-host✓——

Branching logic●●●●●●●●○

App library●●○●●○●●●

Cost @ scale$$$$$$

Workflow toolsAutomation

Read the comparison ↗

Taxonomy · 9 min · Updated Feb 2026

Concierge, reference, operator: the three agent shapes

Most AI “agents” are one of three things in a trench coat. Naming them correctly is the first decision that affects scope, price, and evaluation strategy. A field taxonomy with examples.

CONCIERGECustomer-facing chat. Optimized for time-to-resolution.
REFERENCEInternal Q&A over docs. Optimized for citation accuracy.
OPERATORMulti-step task agent. Optimized for tool-call reliability.

TaxonomyAgents

Read the taxonomy ↗

Comparison · 10 min · Updated Jan 2026

LangGraph vs Vercel AI SDK vs raw OpenAI Agents

Three approaches to building a production agent loop. Each one optimizes for something different: control, ergonomics, or velocity. The one we reach for most often, and why.

Optimized forLangGraphVercelOpenAI

Control●●●●●○●●○

DX●●○●●●●●○

Time to first run●○○●●○●●●

FrameworksEngineering

Read the comparison ↗

§ 02How-tos & Playbooks

Steps you can follow, not theory.

Written as numbered steps with the assumption that you will actually try them. Pulled from real engagements, with the parts that don't work crossed out.

PLAYBOOK

§ 03Concepts & Definitions

A working glossary.

Short definitions of the terms we actually use in client conversations, written so a non-technical operator can use them in a meeting on the same day. Each entry links to a longer explainer.

DEFINITION

What is applied AI?

Applied AI is the use of language models, retrieval, and agentic workflows to change how specific work actually gets done inside a business — as a production system that measurably saves time or adds throughput, not as a demo or a chatbot. The emphasis is on adoption by the people doing the work.

Longer explainer ↗

DEFINITION

What is context engineering?

Context engineering is the deliberate design of what information a model sees at the moment of a request — system prompt, retrieved documents, tool descriptions, history, examples. It accounts for more of output quality than prompt wording does. Most production AI failures are context failures, not prompt failures.

Longer explainer ↗

DEFINITION

What is an eval harness?

An eval harness is the code and data that lets you measure an AI system's quality against a labeled test set, automatically, on every change. It is the closest thing AI engineering has to a unit-test suite. Without one, every change is a guess and every regression is a surprise.

Longer explainer ↗

DEFINITION

What is human-in-the-loop?

Human-in-the-loop (HITL) is a system design where an AI produces a draft and a human reviews, edits, or approves it before the outcome is committed. The trigger is usually a confidence threshold: above the floor, ship; below, route to a person. HITL is how production AI stays both fast and trusted.

Longer explainer ↗

DEFINITION

What is anchor prompting?

Anchor prompting is a pattern where the full instruction set lives in an uploaded reference document, and chat prompts invoke it by name. This stabilizes long agent sessions, reduces context rot, and makes instructions easier to version and update than inline prompts.

Longer explainer ↗

DEFINITION

What is context rot?

Context rot is the degradation of model output quality over a long conversation as the context window fills with irrelevant or contradictory information. Symptoms include rule-following drift, repeated answers, and ignored constraints. The fix is structured truncation, not a bigger window.

Longer explainer ↗

DEFINITION

What is a confidence floor?

A confidence floor is the model-reported probability threshold below which a response is routed to a human reviewer instead of being committed automatically. Typical values are 0.80–0.95. The right floor depends on the cost of a wrong answer relative to the cost of a human review.

Longer explainer ↗

DEFINITION

What is AI literacy?

AI literacy is a working understanding of what current AI tools can and cannot do, how to use them effectively for a specific job, and how to recognize when their output is wrong. It is closer to a research skill than a technical one, and it is the highest-impact training a non-technical team can do this year.

Longer explainer ↗

§ 04Field Essays

Longer, opinionated, from inside the work.

The essays we write when something repeats often enough across engagements to be worth naming. Roughly two a month. No newsletter funnel, no listicle quotas.

May 2026

The Six-Month Rule

Most AI projects fail at month seven because the question of who pays the cost of the system working was never named at month zero. A short diagnostic for the project brief.

Essay · 9 min

Apr 2026

Evals are the deliverable

The reason we write the test set before the agent. A field guide to building eval harnesses that actually steer the project, plus the three failure modes of eval sets that look correct on paper.

Essay · 12 min

Mar 2026

Adoption is the project

A taxonomy of three failure modes that kill AI rollouts after launch: the pilot purgatory, the leadership-line gap, and the trust deficit. What to instrument to catch each one early.

Essay · 15 min

Mar 2026

The case against discovery phases

Why every fourteen-week discovery we've seen could have been a one-week assessment and a courageous “no.” What clients actually need at the start of a project, and what they're being sold instead.

Essay · 7 min

Feb 2026

Stop calling it a copilot

The word has lost any specific meaning. A more useful three-way taxonomy — concierge, reference, operator — that changes how scope, evals, and pricing land in the same conversation.

Essay · 6 min

Dec 2025

What a real handoff looks like

The five artifacts every AI engagement should produce, the test for whether the handoff actually transferred ownership, and the one question that exposes a fake one in thirty seconds.

Essay · 8 min

Nov 2025

Context is the moat, not the prompt

Why the durable advantage in applied AI is the quality of context you can assemble at the moment of a request — not the cleverness of the wording. Three exercises to find where your context is leaking quality.

Essay · 10 min

§ 05Canada & Compliance

Sovereign, funded, federally aligned.

A growing pillar of this index, because most of our clients are here. Plain-English writing on Canada's AI policy, the Voluntary Code of Conduct, the federal Guide on Agentic AI, data sovereignty, and the funding programs that can offset the cost of adoption. We write this section to translate the “AI for All” strategy into systems that get used, owned, and funded — with conditional framing and program terms you should verify before you act.

On the growth & funding track ↗ On the risk & compliance track ↗

Feb 2026

PIPEDA, AIDA, and the Canadian SMB in 2026

What actually governs SMB AI projects today — PIPEDA, Quebec Law 25, and the Voluntary Code of Conduct — now that the proposed AIDA lapsed with Bill C-27 (Jan 2025) and new federal rules are being shaped under the “AI for All” strategy. The three steps to take this quarter, regardless of how the legislation lands.

PIPEDAAIDACompliance

Essay · 11 min · Updated Feb 2026

Jan 2026

Data residency for Canadian AI: a vendor checklist

Which model providers offer Canadian or US-only data-residency, what to ask for in a data-processing agreement, and the small set of patterns that lets sensitive client data stay inside your own infrastructure.

ResidencyProcurement

Reference · 7 min · Updated Jan 2026

Forthcoming

A plain-English guide to Canada's Voluntary Code of Conduct

The six elements of the Voluntary Code of Conduct on the responsible development of advanced generative AI — Accountability, Safety, Fairness, Transparency, Human Oversight, and Validity & Robustness — read as a checklist you can align a real system to, not a press release.

Voluntary CodeGovernance

Guide · In progress

Forthcoming

Bounded Autonomy: the federal Guide on Agentic AI, explained

The four autonomy levels in the federal Guide on the Use of Agentic AI, why we build Level 1–2 assistive systems with read-only access by default, and how recoverability — a literal kill switch and immutable logs — defends against automation drift and prompt injection.

Agentic AIBounded autonomy

Guide · In progress

Forthcoming

Which federal programs actually fund AI adoption?

A working map of the programs an adoption project may be eligible for — BDC LIFT for financing, the Canada-Ontario Job Grant (COJG) for training cost, and the Productivity Super-Deduction for written-off automation assets — each subject to current program terms and eligibility.

BDC LIFTCOJGSuper-Deduction

Reference · In progress

Forthcoming

Data sovereignty: keeping AI workflows on Canadian soil

How we architect for Canadian data residency — deployable on Canadian cloud regions (e.g. AWS Canada Central) and Canadian foundation models such as Cohere — so proprietary data and IP stay onshore, aligned with PIPEDA, Quebec Law 25, and AIDA readiness.

SovereigntyData residencyPIPEDA

Reference · In progress

§ 06FAQ

Questions we keep answering.

Short, direct answers to the questions that come up on the first call with new clients. Each answer is forty to sixty words because that is what survives extraction by an answer engine. Same questions, longer answers, live across the rest of this page.

What is applied AI?

Applied AI is the use of large language models, retrieval systems, and agentic workflows to change how specific work actually gets done inside a business — not as a demo or a chatbot, but as a system in production that measurably saves time, reduces errors, or adds new throughput.

How long does a typical AI project take?

Most production-grade applied-AI engagements run four to eight weeks end-to-end. A one-week assessment scopes the work; a two-to-four-week build delivers a working pass; a final pilot and adoption phase hardens the system and trains the team. Projects that run longer usually have an organizational problem, not a technical one.

How much does an AI engagement cost?

Published price bands at Applied AI North range from CA$5,500 for a one-week assessment to CA$72,000 for a full agent build with monitoring and adoption support. Most clients combine two or three engagements over six to nine months, averaging CA$25,000 to CA$85,000 in total. Workshops start at CA$3,500.

What is the difference between an agent and a chatbot?

A chatbot responds to messages. An agent decides what to do, executes tool calls, observes the results, and loops until a task is complete. Agents have memory, planning steps, and side-effects: they send emails, write to databases, or call APIs. Chatbots produce text; agents produce outcomes.

How do you measure AI return on investment?

Measure the workflow the AI replaces, not the AI itself. Track time-per-task before and after, error rates against a labeled ground-truth set, human-review percentage, and adoption rate (daily active users among the intended audience). A 92% accurate system used by the whole team beats a 99% accurate system nobody trusts.

What is context engineering?

Context engineering is the deliberate design of what information an AI model sees at the moment of a request — the system prompt, retrieved documents, tool descriptions, conversation history, and structured examples. It accounts for more of the output quality than prompt wording does. Most production AI failures are context failures, not prompt failures.

What is an eval set?

An eval set is a labeled collection of input-output examples used to measure how well an AI system performs against your real data, before and after every change. For a document-extraction agent, that means 200 to 2,000 historical documents with the correct answers attached. The eval set is the deliverable; the agent is the side-effect of building it.

Are AI projects different for Canadian businesses?

Yes. We build sovereign, pro-worker systems engineered to align with the frameworks actually in force today — PIPEDA, Quebec Law 25, and Canada's Voluntary Code of Conduct on generative AI — and built for readiness for the federal AI regulation signalled under the 2026 “AI for All” strategy. (The proposed Artificial Intelligence and Data Act, AIDA, was part of Bill C-27, which died at prorogation in January 2025; there is no federal AI statute in force right now.) Practically: we architect for Canadian data residency, keep agentic systems to bounded, human-in-the-loop autonomy, and structure work so it may qualify for federal productivity funding — subject to current program terms.

What is human-in-the-loop?

Human-in-the-loop (HITL) is a system design where an AI generates a draft or decision but a human reviews, approves, or edits it before the outcome is committed. The trigger is usually a confidence threshold: if the model is above 0.86, ship; below, route to a person. HITL is how production AI stays accurate and trusted.

How do I know if my team is ready to adopt AI?

Three signals: leaders can describe the workflow they want to change in plain English; one named person on the team is willing to own the system after launch; and you can produce twenty real examples of the work the AI will do. If any of these is missing, start with a literacy workshop before a build.

Subscribe

Two pieces a month. No tracking, no funnel.

RSS or email, your call. Plain text. We will never sell your address, and we will not send anything that isn't writing.

or grab the RSS: /feed.xml ↗

Reading with a project in mind? Pick a track: Growth & Funding ↗ · Risk & Compliance ↗

Practical AI writing for people who have to use it.

The Six-Month Rule.

Side-by-side, no fence-sitting.

Claude Sonnet 4.5 vs GPT-5 vs Gemini 2.5 Pro

n8n vs Make vs Zapier for AI workflows

Concierge, reference, operator: the three agent shapes

LangGraph vs Vercel AI SDK vs raw OpenAI Agents

Steps you can follow, not theory.

How to write your first eval set

A six-hour AI literacy curriculum for non-technical teams

Twelve cost-control levers for production AI agents

Confidence floors: the math, the bug, and the fix

A two-week build playbook for AI-aware websites

A working glossary.

What is applied AI?

What is context engineering?

What is an eval harness?

What is human-in-the-loop?

What is anchor prompting?

What is context rot?

What is a confidence floor?

What is AI literacy?

Longer, opinionated, from inside the work.

The Six-Month Rule

Evals are the deliverable

Adoption is the project

The case against discovery phases

Stop calling it a copilot

What a real handoff looks like

Context is the moat, not the prompt

Sovereign, funded, federally aligned.

PIPEDA, AIDA, and the Canadian SMB in 2026

Data residency for Canadian AI: a vendor checklist

A plain-English guide to Canada's Voluntary Code of Conduct

Bounded Autonomy: the federal Guide on Agentic AI, explained

Which federal programs actually fund AI adoption?

Data sovereignty: keeping AI workflows on Canadian soil

Questions we keep answering.

What is applied AI?

How long does a typical AI project take?

How much does an AI engagement cost?

What is the difference between an agent and a chatbot?

How do you measure AI return on investment?

What is context engineering?

What is an eval set?

Are AI projects different for Canadian businesses?

What is human-in-the-loop?

How do I know if my team is ready to adopt AI?

Two pieces a month. No tracking, no funnel.