·7 min read

AI for Work: Automate Without Breaking What Works

AI for Work: Automate Without Breaking What Works
Photo by Surja Sen Das Raj on Unsplash
Authors

AI for Work: Automate Without Breaking What Works

88% of organizations now use AI in at least one business function, according to McKinsey's 2025 State of AI report. Only about a third have started scaling it. That gap is not explained by resistance or skills alone. The real pattern hiding inside that data is this: most teams automate the wrong thing first, and the workflows they automate were already workarounds for problems nobody officially owns. This article is about how to identify that trap before you walk into it, and what to do instead.

Workflow Debt Is the Failure Mode Nobody Names

"Invisible workflow debt" doesn't appear in most AI adoption frameworks, but it describes the actual failure condition more accurately than anything else. Workflow debt is what accumulates when teams build manual processes around system limitations that were never fixed. A CSV exported from the CRM every Monday morning because the CRM can't talk to the ESP. A Slack message with order counts because the dashboard doesn't refresh fast enough. A copy-paste step between two platforms that were supposed to integrate three years ago.

These workarounds function. They become habits. They get documented as SOPs. Then someone proposes automating them with AI, the automation ships, and the team celebrates the efficiency gain. What actually happened is that the underlying system gap is now load-bearing inside an automated pipeline. When that gap breaks or changes, the failure isn't a person forgetting a manual step. It's a silent data error propagating through an agent that no one is watching.

McKinsey's research on agentic marketing deployments identifies system interoperability, not model design, as the more common limiting factor. The bottleneck was never the model. It was the data pipes that the model would have to connect. Bolt AI on top of broken pipes and you get faster broken output.

Before any automation decision, the useful question is why the workflow exists in its current form, not what it does. Why it does it that way. The answer often reveals a system constraint that automation will now depend on rather than resolve.

On-Brand But Off-Strategy: Where AI Writing Tools Actually Break Down

AI writing tools illustrate workflow debt in a specific and measurable way. Teams using tools like Jasper or Copy.ai frequently report output that reads correctly but misses the point strategically. Brand voice was documented and encoded. Decision logic was not.

Brand voice is a style parameter. It answers "how do we sound." Decision logic answers "who is this for, why are we saying it now, what do we want them to do and why would they." That second layer rarely lives anywhere official. It exists in the heads of the two or three people who have been doing the work long enough to have absorbed it. When you encode brand voice into an AI tool without encoding decision logic, the tool produces content that sounds like you but reasons like no one in particular.

Claude's Projects feature and ChatGPT's Custom Instructions exist precisely to close this gap, but only if you put the right content in them. A system prompt that says "we speak conversationally and avoid jargon" solves a style problem. One that says "our primary audience is operations managers in mid-size retail who already know the basics and are trying to justify a budget decision upward" solves a strategy problem. The second version requires someone to have actually written down the decision logic that normally lives in someone's head. For practical guidance on building prompts at that level, system prompts that actually work covers the structural choices that matter most.

A relevant data point here: Deloitte's "Global State of Generative AI" report (Q1 2025) found that 66% of organizations report productivity gains from AI, but only 20% report revenue growth, while 74% say revenue growth is what they were hoping for. The gap between productivity and revenue tracks almost exactly with the gap between style automation and decision automation.

Audit the Process Before You Touch the Tool

Starting an AI deployment with a process audit rather than a tool selection is what produces durable results. The sequence most teams follow is backwards: pick the tool, find the use case, train the team. Running that sequence in reverse changes the outcome.

A minimal audit asks three questions about any process you are considering automating. First, what system limitation is this process working around? If the honest answer is "none, this is genuinely the right way to do this," proceed. If the answer involves a platform that doesn't integrate, a report that doesn't exist, or a step someone added because of a one-time incident five years ago, stop and address that first.

Second, where does the decision logic for this process live? If it lives exclusively in one person's judgment and has never been written down, automating the process will surface that gap rather than fill it. You will get fast output with no quality floor.

Third, what breaks if this automated process produces wrong output silently? The risk profile of a human doing a task manually is different from an automated pipeline doing it at scale. A person making a bad judgment call on one email is recoverable. An agent applying that same bad judgment to 50,000 sends is not.

McKinsey's characterization of high-performing organizations as companies that "redesign workflows" rather than just deploy tools is not just strategic language. It describes a concrete operational practice. They ask these questions. Most companies skip them and wonder later why their pilots didn't scale.

From My Experience

In e-commerce operations, this pattern is genuinely ubiquitous. Workflows that seem most automatable are often the ones that exist because Magento and the marketplace feed don't agree on stock counts, or because the ESP doesn't pull from the catalog in real time so someone built a manual refresh step. Teams routinely propose AI agents for tasks that turn out to be compensating for a broken product feed synchronization. Automating the compensation loop makes the underlying problem less visible, not less real. Before any automation conversation, the question worth asking is whether the process solves a business problem or patches a system gap, because that answer changes everything about whether automation is the right next move.

FAQ

Why do AI pilots succeed in demos but fail to scale? Demo environments usually involve clean data, a single use case, and someone who understands both the AI tool and the underlying process. At scale, the data is messier, the use cases multiply, and the people running the process may not understand the assumptions baked into the original setup. McKinsey's 2025 State of AI report notes that only about a third of companies are scaling their AI programs despite near-universal adoption, which suggests the demo-to-production gap is the norm, not the exception.

Should I document SOPs before deploying AI tools? Documenting what the process does is useful but not sufficient. More valuable documentation captures why the process takes its current form, what decisions get made inside it, and what system constraints it is working around. Style and step-by-step documentation can be fed into Claude's Projects or similar features. Decision logic requires a different exercise: talking to the people who actually do the work and extracting the judgment calls that never got written down.

What is the right first AI use case for a non-technical team? Anything where the output is immediately reviewable by someone who knows what good looks like, the process logic is already explicit and documented, and the cost of a wrong output is low and reversible. First drafts of internal communications, summarization of meeting notes, and structured data extraction from unstructured inputs all meet these criteria. For non-technical teams looking to build the underlying fluency, AI literacy for non-technical teams is worth reading before picking a tool.

AI does not make bad processes good. It makes them faster and harder to see clearly. Audit what you are automating before you automate it, because the workarounds that have survived the longest are usually the ones that will cause the most damage when they finally break inside an automated pipeline.

By João Schuller — E-commerce Analyst & Product Owner. Draft generated with AI from the author's notes and experience; facts verified and text reviewed by João Schuller.
João Schuller
João Schuller

E-commerce Analyst & AI Builder

E-commerce Analyst & Product Owner at the largest flooring and tile retailer in Southern Brazil. 5 years in online retail working with Magento, VTEX, GA4, and Claude. Writes about practical AI for professionals who build things.

Read more about João →

0/1000