Stop Scripting Your Coding Agent. Start Steering It.

The most common mistake people make when configuring coding agents isn’t giving them too little instruction. It’s giving them too much of the wrong kind — in the wrong places.

I’ve watched teams spend weeks crafting elaborate agent instructions — specifying exact tools, prescribing file-by-file workflows, mandating specific libraries for every decision. And I’ve watched those same agents produce brittle, mediocre output that falls apart the moment something unexpected happens.

Then I’ve seen a different approach — where the instructions read more like an engineering handbook than a script — and the results are dramatically better. The difference isn’t volume. It’s knowing which parts of agent behavior deserve rigid specificity and which ones deserve principles. Most people get this backward.

The Specificity Trap (Where It Hurts)

Here’s what it looks like when specificity is applied to the wrong layer of agent instructions:

“Use the fs module to read the config file at ./config/app.json. Parse the JSON. Check if the database.host field exists. If it doesn’t, throw an error with the message ‘Missing database host’. Use pg to create a connection pool with max 10 connections…”

This feels thorough. It feels safe. You’re telling the agent exactly what to do, so what could go wrong?

Everything. Because you’ve turned a reasoning engine into a script runner.

When the config file moves, the agent breaks. When the project uses YAML instead of JSON, the agent breaks. When the database is MongoDB instead of Postgres, the agent breaks. It can’t adapt because you never told it what you were actually trying to accomplish — you only told it which levers to pull.

Worse, you’ve consumed the agent’s context window with mechanical instructions, leaving less room for the agent to reason about what actually matters: the intent behind the code.

What Principle-Based Instructions Look Like

Compare the above with this:

“Configuration must be validated at startup. The application should fail fast with clear error messages if required values are missing. Database connections should be pooled and bounded. Never silently swallow configuration errors.”

Notice what changed. There are no tool names. No file paths. No library choices. Instead, there are architectural principles: fail fast, validate early, bound resources, surface errors.

A capable coding agent receiving these instructions will:

Find the config file wherever it lives
Use whatever parsing library matches the format
Choose the right database driver for the stack
Implement connection pooling appropriate to the ORM in use
Write meaningful error messages

And critically, it will do all of this in a way that’s consistent with the existing codebase, because you’ve freed it to observe and adapt rather than follow a rigid script.

Why This Works: Agents Are Reasoners, Not Runners

The reason principle-based instructions outperform specific ones — for the judgment layer — comes down to how modern coding agents actually work.

A coding agent is, at its core, a reasoning system that operates over a context window. It reads your codebase, understands patterns, and generates code that fits. It’s not executing your instructions like a shell script — it’s interpreting them within the context of everything else it knows.

When you give it specific instructions (“use library X, call function Y”), you’re overriding its ability to reason. You’re replacing its judgment — which is the thing you’re paying for — with your own prescriptive choices, which may or may not match the reality of the codebase.

When you give it principles (“errors should be explicit, not silent”), you’re informing its judgment. You’re giving it a framework for making decisions, not making the decisions for it.

This mirrors how effective engineering organizations work. The best teams don’t have 200-page procedure manuals telling developers which functions to call. They have architectural decision records, design principles, and coding standards that guide judgment. A senior engineer who understands “we prefer composition over inheritance” will make better decisions across a thousand situations than one who was handed a flowchart for ten.

The same is true for agents. An agent that understands your architectural principles will make better decisions across an entire codebase than one that was given step-by-step instructions for a handful of tasks.

The Three Layers That Actually Matter

When restructuring agent instructions, think in three layers:

1. Architectural Principles

These are the non-negotiable beliefs your codebase is built on. They don’t change between tasks.

“Prefer pure functions. Side effects should be explicit and contained.”
“Every public API endpoint must validate its input before processing.”
“State changes flow in one direction. Components never mutate shared state directly.”
“Test behavior, not implementation. Tests should survive refactors.”

These create a worldview that shapes every decision the agent makes.

2. Quality Criteria (State, Not Action)

Instead of saying “run ESLint” or “add error handling,” describe the state the code should be in when the work is done:

“All exported functions have explicit return types.”
“Error states are represented in the type system, not as thrown exceptions.”
“No function exceeds 40 lines.”
“Database queries are parameterized. No string concatenation in SQL.”

These are binary, testable conditions. The agent can verify them itself. And because they describe outcomes rather than steps, the agent is free to reach those outcomes however the current codebase demands.

3. Anti-Patterns (What Must Not Happen)

The most underused category. Telling an agent what to avoid is often more powerful than telling it what to do:

“Never store secrets in code or configuration files committed to version control.”
“No synchronous I/O on the request path.”
“Never catch an exception and do nothing with it.”
“No circular dependencies between modules.”

Anti-patterns create guardrails without constraining creativity. The agent can build whatever solution it wants — it just can’t violate these boundaries.

The Compound Effect

Here’s the insight most people miss: these three layers don’t just help with individual tasks. They compound across your entire workflow.

An agent that has internalized your architectural principles will write new code that fits the existing codebase without being told to. It will choose the same patterns, the same error handling style, the same naming conventions — not because you specified them for this task, but because the principles naturally guide it there.

Specific instructions, on the other hand, are disposable. “Use the fs module to read config” helps with exactly one task. “Configuration must be validated at startup and fail fast on missing values” helps with every task that touches configuration, forever.

This is the difference between training an agent and scripting one. But it’s only half the picture.

When Specifics Are Exactly Right

There’s a category of agent behavior where principles fail and specificity is essential. Understanding where that line falls is what separates a good agent configuration from a great one.

Consider these two types of instruction:

“Errors should be surfaced clearly to the user.”

“Every response must begin with a status header. Phase announcements must fire in order. Output format must include sections A, B, and C with these exact delimiters.”

The first is a principle. The second is a protocol. And the second should be specific, because protocol isn’t a place where you want the agent exercising judgment. You don’t want it creatively reinterpreting your output format. You don’t want it deciding that today, status headers aren’t important.

This distinction — judgment vs. protocol — is the real line that separates where principles belong from where specifics belong.

What Counts as Protocol

Protocol is anything where consistency matters more than adaptability:

Output format. If downstream systems, dashboards, or human workflows depend on structured output, specify the exact structure. The agent’s job is to produce it reliably, not reinvent it.
Phase progression. If your agent follows a multi-step workflow (gather context → plan → execute → verify), the sequence should be explicit. The agent shouldn’t skip steps because it feels confident.
Integration contracts. API calls to external services, webhook payloads, deployment commands — these have no room for creative interpretation. The endpoint is what it is.
Safety gates. “Always ask before deleting files” isn’t a principle to internalize — it’s a rule to follow mechanically, every time, without exception.

What Counts as Judgment

Judgment is anything where the right answer depends on context the agent discovers at runtime:

Which design pattern fits this module
How to handle an error in this specific function
Whether to refactor or patch
Which testing strategy covers this case
How to name things consistently with the existing codebase

These are the decisions where principles outperform scripts — because the agent has information you didn’t have when you wrote the instructions.

The Refined Rule

Script the protocol. Principle-ize the judgment.

Most agent configurations get this backward. They script the judgment (“use library X for task Y”) while leaving protocol vague (“format the output nicely”). The result: an agent that makes rigid decisions about things that should be flexible, and flexible decisions about things that should be rigid.

The fix is to audit your instructions and sort each one into two buckets:

	Protocol	Judgment
Should be	Specific, exact, non-negotiable	Principle-based, adaptive, contextual
Because	Consistency matters more than creativity	Context matters more than consistency
Example	“Output must include a verification section with pass/fail for each criterion”	“Verification must produce evidence, not just assertions”
Failure mode when wrong	Agent reinvents your workflow every run	Agent ignores codebase context to follow your stale script

The protocol column keeps your agent predictable where predictability matters. The judgment column keeps it intelligent where intelligence matters. Together, they produce an agent that’s both reliable and adaptive — which is what you actually want.

Making the Shift

If you’re currently running an agent with heavily prescriptive instructions, here’s how to migrate:

Step 1: Extract principles from your specifics. Look at your existing instructions and ask: “What belief about good software does this specific instruction encode?” The instruction “use try/catch around every database call” encodes the principle “database operations must have explicit error handling.” Write down the principle. You’ll decide what to delete in Step 4.

Step 2: Describe states, not steps. Rewrite procedural instructions as conditions. “Run the linter after making changes” becomes “All code conforms to the project’s lint configuration.” The agent will figure out how to make the state true.

Step 3: Add anti-patterns from your code review history. Look at the last 20 pull request comments from your team. The recurring feedback — “don’t do X,” “we never Y,” “this should be Z” — those are your anti-patterns. Codify them.

Step 4: Sort specifics into protocol vs. judgment. Don’t blindly delete all specific instructions. Instead, ask: “Is this governing a judgment call or a protocol?” If it’s judgment — which library to use, how to structure code, what pattern to follow — replace it with a principle. If it’s protocol — output format, phase sequence, integration contracts, safety gates — keep it specific. Make it more specific.

Step 5: Remove tool references from the judgment bucket. For everything you’ve identified as judgment, delete mentions of specific tools, libraries, and file paths. The agent will discover these from the codebase. If it can’t, your codebase has a discoverability problem worth fixing independently.

The Bigger Picture

The shift from specific instructions to architectural principles isn’t a universal rule — it’s a sorting rule. The real skill is knowing which category each instruction belongs to.

Protocol needs precision. Judgment needs principles. Most agent configurations over-script judgment (making the agent rigid where it should be adaptive) and under-script protocol (making the agent inconsistent where it should be reliable). Flip that, and everything improves.

The best agent configurations I’ve seen read like two documents stitched together: a tight operational spec for how the agent should behave — its workflow, its output format, its safety constraints — and a loose engineering handbook for how it should think — its design principles, its quality standards, its anti-patterns.

Give your agent a worldview for judgment and a rulebook for protocol. You’ll get an agent that’s both reliable and intelligent — which is the combination that actually matters.

Stop Scripting Your Coding Agent. Start Steering It.

The Specificity Trap (Where It Hurts)

What Principle-Based Instructions Look Like

Why This Works: Agents Are Reasoners, Not Runners

The Three Layers That Actually Matter

1. Architectural Principles

2. Quality Criteria (State, Not Action)

3. Anti-Patterns (What Must Not Happen)

The Compound Effect

When Specifics Are Exactly Right

What Counts as Protocol

What Counts as Judgment

The Refined Rule

Making the Shift

The Bigger Picture

Comments

Leave a Reply Cancel reply

More posts

Stop Scripting Your Coding Agent. Start Steering It.

Embracing Modernity: Transitioning from Django to FastAPI for a Cleaner Architecture

Building Scalable and Testable Python Applications with Modular Monoliths