AI Business TransformationBusiness Ops
Our ApproachInsightsStart a Conversation
AI Strategy

From Design to Deployment: Why the Level 3 to Level 4 Transition Breaks Every Playbook Your Organization Has Used Before

By Shawn Plaster, Founder & CEO, Plaster Group

Article 17 of 27 — Plaster Group's AI Business Transformation Methodology

Domain OwnersCIOsCAIOCEOCSOVPsDirectorsLevel 3 to Level 4Transition
14 min read

This article is part of a 27-article series on the AI Business Transformation Methodology. This piece describes why the transition from Level 3 design to Level 4 deployment is structurally different from any design-to-build transition the organization has experienced, and what that means for how leadership approaches it.

Plaster Group Five-Level AI Business Transformation Methodology — Strategy, Transformation Imperatives, Workflow Transformation, AI Enablement, Continuous Transformation, with feedback loop from Level 5 back to Level 1.

The readiness gate from Article 16 has been passed. The domain owner's Level 3 deliverables are complete, quality-gated, and confirmed by the CIO's team as sufficient to build on. The Level 1 triad has committed the resources. For the first time in the transformation, the CIO's organization is about to become the primary execution engine. Technology budgets have been approved. Implementation teams are being mobilized.

Every executive in the room has been here before. They have managed ERP implementations, digital transformation programs, and large-scale systems deployments. They have navigated the transition from business requirements to technical build dozens of times across their careers. They know what a design-to-build handoff looks like, how systems integrators work, what a fit-gap analysis produces, and how a cutover plan unfolds.

That experience is about to become their most dangerous liability.

The research on AI scaling failures does not point to technology as the primary cause. McKinsey's Rewired research, based on more than 200 at-scale transformations, found that 72% of companies stall at the scaling stage, and the root cause is overwhelmingly organizational rather than technical.1 Gartner's data shows 42% of AI initiatives failed in 2025, more than double the failure rate from just one year earlier.2 BCG found that 74% of companies have not achieved tangible value from their AI investments, with only 5% generating substantial value at scale.3 These are not organizations that lacked AI technology. They are organizations that applied familiar transformation playbooks to an unfamiliar kind of transformation and discovered too late that the playbook did not fit.

This article explains why.

What Leaders Know from Previous Transformations — And Why It Does Not Transfer

The design-to-build transition in traditional transformation programs follows a pattern that Fortune 500 leaders know well. In an ERP implementation, the technology is selected first. The organization chooses SAP or Oracle, contracts a systems integrator, and then enters a design phase where business requirements are documented and evaluated through fit-gap analysis: which requirements does the selected technology already handle, and which require customization? The design is adapted to the technology's capabilities. The systems integrator builds what the blueprint specifies. The system behaves deterministically. Testing validates that the configuration matches the design. Training happens on the actual configured system during user acceptance testing. Go-live is a planned cutover event.

This playbook works for ERP because ERP systems are deterministic. Configure an approval workflow with a $10,000 threshold, and every transaction above that threshold triggers the approval. Every time. Predictably. The design-to-build handoff is fundamentally a document transfer: the business team produces the blueprint, the technical team builds to specification, and deviations from the blueprint are defects to be corrected.

Leaders who have succeeded with this playbook naturally default to it. McKinsey's research on ERP transformations documents the pattern: stage-gate reviews, blueprint governance, milestone-based delivery, and systems integrator accountability tied to deliverable completion.1 It is an effective framework for deterministic technology. The problem is that AI is not deterministic technology, and the transition from Level 3 to Level 4 is structurally different in ways that undermine every assumption the traditional playbook depends on.

Six Differences That Change the Nature of This Transition

The differences between AI business transformation and traditional transformation at this junction are not differences of degree. They are differences of kind. Each one has practical implications for how leadership approaches Level 4.

Technology selection follows design, rather than preceding it. In traditional transformations, the technology constrains the design. In this methodology, Level 3 produced workflow designs unconstrained by any specific technology's limitations, because the designs describe what the human-AI collaboration should look like rather than what a particular system can do. The CIO's team now selects technology to serve those designs. This inversion means that technology evaluation at Level 4 must be conducted against the specific workflow specifications Level 3 produced, not against generic capability demonstrations from vendors. The CIO cannot evaluate AI platforms in the abstract. The evaluation criteria come from the workflow designs.

The system produces probabilistic outputs rather than deterministic results. An ERP system does exactly what its configuration specifies. An AI system produces outputs that vary based on inputs, context, and the model's learned patterns. When an AI system reviews a customer's full service history to recommend a retention strategy, it synthesizes patterns across hundreds of interactions and may recommend proactive outreach for one customer and a pricing adjustment for another with similar account profiles, because the AI identified contextual differences that a rules-based system would treat identically. The traditional fit-gap analysis does not translate because you cannot test every possible scenario against a system whose behavior is not predetermined. Evaluating whether an AI technology is sufficient for a workflow design requires judgment about the technology's capabilities in the specific context the workflow demands, which is a fundamentally different kind of assessment than checking configuration against requirements.

Graduated human oversight has no precedent in previous technology deployments. The governance framework established in Article 7 and operationalized into the workflow designs at Level 3 specified different levels of human oversight for different risk classifications. The CIO's implementation team must translate these into technical controls: automated monitoring, escalation triggers, override mechanisms, and audit trails calibrated to the specific risk level of each AI-enabled workflow step. No ERP implementation has ever required this. The governance specifications from Level 3 are direct technical requirements for the implementation team, not aspirational principles to be considered later.

Integration connects AI to systems it must interpret, not merely exchange data with. Traditional system integration connects applications that pass structured data through defined interfaces. AI integration connects to systems where the AI must interpret unstructured or semi-structured information, reason about it, and take actions that may vary based on context. The research identifies integration with legacy systems as the single most cited technical barrier to AI scaling: production environments require connecting to enterprise systems that were never designed to support the kind of interaction AI requires.4 The workflow designs from Level 3 specified what data the AI needs and what actions it takes. The CIO's team must now determine how to provide that access within the constraints of the existing technology estate.

The first deployment will require iteration, and that iteration is expected. In traditional transformations, a gap between the design and the deployed system is a defect. In AI business transformation, the first deployment will reveal differences between how the AI behaves in the workflow design and how it behaves in production with real data, real edge cases, and real user interactions. The Level 3 designs were the best possible designs given what was known at the time. Production will surface what could not be known until the AI system encountered the actual operating environment. This means the domain owner cannot hand off the deliverables and walk away. They must remain engaged through the iteration cycle, because only they can determine whether a proposed adjustment to the original design is an acceptable technical accommodation that preserves the transformation intent or a compromise that undermines it.

Workforce competence means exercising judgment, not following procedures. Operating within an AI-enabled workflow requires a fundamentally different kind of competence than operating within an ERP system. ERP training teaches procedures: enter this data, click this approval, follow this sequence. AI training teaches judgment: evaluate whether this AI output is reliable, decide when to override, understand the boundaries of the AI's capabilities in your specific domain. Every parallel track entering Level 4 must account for this difference. The training that develops system-specific competence, the governance controls that enforce oversight requirements, the security architecture that protects production systems, and the data infrastructure that feeds the AI all must be designed for a workforce that exercises judgment over probabilistic systems rather than follows procedures within deterministic ones.

How the Organization Reconfigures at This Transition

Level 3 was led by the domain owner with the CAIO's department as the primary advisory partner. Level 4 shifts the center of gravity to the CIO's organization, which owns technology selection, integration architecture, deployment, and technical operations. This is a partnership reconfiguration, not a handoff where one side walks away.

The CIO's organization becomes the implementation engine. Articles 18 and 19 cover their Level 4 mandate in detail. What matters at the transition is that the CIO's team approaches Level 4 with the understanding that they are implementing technology to serve workflow designs that were produced through months of careful business transformation work, not building systems from a traditional requirements document.

The domain owner shifts from leading the design work to partnering on implementation. Their ongoing accountability is the iteration decisions that will arise when the deployed AI behaves differently than the design anticipated. The domain owner is the only person who can evaluate whether the transformation's business intent is being preserved through those adjustments. This accountability does not diminish at Level 4. It changes form.

The CAIO's department shifts from leading education and workflow advisory to ensuring cross-domain consistency as implementation proceeds. Their embedded translators, who participated in the domain's Level 3 work, serve as the bridge between the domain owner's business understanding and the CIO's technical execution. During technology selection specifically, the CAIO's team works hand-in-hand with the CIO's organization in an advisory role, helping evaluate whether what AI vendors are demonstrating or claiming is realistic for the specific workflow designs the domain produced at Level 3.

Every parallel track shifts gears simultaneously. Governance moves from framework to technical controls. Data moves from readiness assessment to architecture. Security activates for production. Training shifts from fluency building to system-specific competence development. The transition is not a single handoff between two parties. It is a coordinated shift across the entire transformation infrastructure, and it requires the same deliberate orchestration that Level 3 demanded.

The Iteration Principle

The single most important mindset shift for leaders entering Level 4 is accepting that iteration is built into the process rather than being a sign that something went wrong.

In traditional transformations, the goal is to deploy the system as designed. Deviations trigger change requests, and change requests carry cost and schedule implications that the organization treats as failures to be minimized.

The iteration gap in AI business transformation does not arise because the workflow designs were poorly done. It arises because the workflow redesign teams designed the best possible workflows based on their understanding of what AI capabilities could deliver, and the technology selection process identified solutions whose vendors represented they could meet those requirements. In practice, the technology may not perform the specified capability in the manner the workflow design requires. This is common in the current AI landscape, where vendor claims frequently outpace what the technology can reliably deliver in production at enterprise scale. A capability that performs well in a demonstration environment may struggle with the volume, complexity, or edge cases that the actual workflow encounters. When this gap surfaces, the domain owner and CIO's team must work together to determine whether the workflow can be adjusted to accommodate the technology's actual capabilities without compromising the transformation intent, or whether a different technology solution is needed that can deliver what the workflow requires.

The methodology's approach is different. The Level 3 designs represent the organization's best understanding of what the human-AI collaboration should look like. The first deployment tests that understanding against reality. Where reality confirms the design, the implementation proceeds. Where reality reveals that the design assumed AI capabilities or behaviors that do not hold in production, the domain owner and CIO's team adjust together, preserving the transformation intent while accommodating the technical reality. BCG's research on organizations that successfully scale AI reinforces this approach: leaders who succeed "fundamentally redesign workflows" and then iterate based on what deployment reveals, rather than treating the initial design as immutable.3

This iteration partnership between the domain owner and the CIO's team is what prevents the organization from falling into either trap. It is also what makes Level 4 fundamentally different from the traditional build phase, where the systems integrator builds to specification and the business team waits for delivery. In AI business transformation, the business and technology teams work together through deployment because the nature of the technology demands it.

The transition from Level 3 to Level 4 is the moment when the 70% of the work that determines AI success meets the 30% that makes it operational. The articles that follow describe how to navigate each dimension of Level 4: technology selection, integration architecture, deployment, data architecture, training, and measurement. The research is clear that most organizations fail at this junction. The methodology is designed to ensure yours does not, but only if leadership approaches Level 4 with the understanding that this is a different kind of deployment than anything the organization has done before.

Sources

  1. 1.McKinsey, Rewired: The McKinsey Guide to Outcompeting in the Age of Digital and AI, 2023 (based on 200+ at-scale AI transformations). 72% stall at scaling; stage gate review process; blueprint governance; design-first organizations outperform https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/rewired-to-outcompete
  2. 2.Gartner, 2025. 42% of AI initiatives failed in 2025, up from 17% in 2024; at least 50% of GenAI projects abandoned after proof of concept; 72% of CIOs report breaking even or losing money on AI investments https://www.gartner.com/en/articles/genai-project-failure
  3. 3.BCG, "The Widening AI Value Gap," September 2025 (1,250+ firms). 74% have not achieved tangible value; only 5% generate substantial value at scale; future-built companies deploy in 9-12 months vs 12-18 months; leaders "fundamentally redesign workflows" then iterate https://www.bcg.com/publications/2025/are-you-generating-value-from-ai-the-widening-gap
  4. 4.McKinsey, "The State of AI in 2025," November 2025 (1,993 participants). Three critical readiness requirements for scaling: standardized executable workflows, modern API interfaces around legacy systems, governance frameworks that make autonomous action observable and interruptible; only 21% have fundamentally redesigned workflows https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai

Frequently Asked Questions

We have been through multiple ERP implementations. What specifically should we unlearn?

Three things. First, unlearn the assumption that technology selection comes first. Your ERP experience tells you to choose the platform, then design around its capabilities. In AI business transformation, the design is done. Technology serves the design, not the other way around. Second, unlearn the assumption that gaps between design and deployed system are defects. In AI deployment, iteration is expected because AI behavior in production is inherently different from behavior in testing. Treating every deviation as a change request to be minimized will either constrain the AI's value or delay the deployment indefinitely. Third, unlearn the assumption that the business team hands off and waits for delivery. The domain owner must remain engaged throughout Level 4 because the iteration decisions require business judgment that the CIO's technical team cannot make alone.

How do we prevent the iteration cycle from becoming scope creep that delays the transformation indefinitely?

The iteration principle does not mean open-ended redesign. It means targeted adjustments where AI behavior in production does not match the assumptions in the workflow design. The boundaries are clear: the transformation intent defined at Level 2 and the workflow designs produced at Level 3 are the anchors. Iteration adjusts how the AI serves those designs, not whether the designs themselves were right. If the iteration reveals that a fundamental assumption in the workflow design was wrong, that is a Tier 2 quality gap that Article 16 should have caught. The domain owner's role in iteration is to hold the line on transformation intent while allowing technical accommodation where the AI's actual capabilities require it.

Can multiple domains transition to Level 4 simultaneously?

Yes, if the CIO's organization has the capacity and the cross-department interfaces from Article 14 are confirmed. The CAIO's department plays a critical coordination role here, ensuring that domains transitioning simultaneously maintain the consistency that Level 3 established. The risk of simultaneous transition is not conceptual but practical: the CIO's implementation team has finite capacity, and spreading that capacity too thin across multiple domains produces shallow implementation in all of them rather than deep implementation in any. The Level 1 triad's portfolio view should inform the sequencing decision.

What happens when the CIO's team says a Level 3 design cannot be implemented as specified?

This is exactly what the iteration partnership exists to resolve. The CIO's team proposes an alternative that is technically achievable. The domain owner evaluates whether that alternative preserves the transformation intent. The CAIO's translator helps both parties understand whether the proposed change affects cross-domain consistency. If the alternative preserves the intent, it proceeds. If it compromises the intent, the domain owner and CIO's team work together to find a different technical approach. If no technical approach can serve the design's intent, the domain owner escalates to the Level 1 triad with a clear description of the tradeoff, because that is a strategic decision about whether to accept reduced transformation value or invest in a different technical solution.

How should the CIO evaluate AI technologies against workflow designs when AI behavior is probabilistic?

The CIO's team cannot evaluate AI platforms in the abstract or based solely on vendor demonstrations. The evaluation must be conducted against the specific workflow designs Level 3 produced: can this technology perform the specific tasks the workflow requires, at the quality level the governance framework demands, with the data that is actually available? This evaluation requires the domain owner's participation because the business context determines what "sufficient" means. A 95% accuracy rate may be excellent for internal document classification but inadequate for customer-facing financial advice. The workflow designs and governance classifications from Level 3 provide the evaluation criteria. Article 18 covers the CIO's technology evaluation process in detail.

This series addresses “what” to do, not “how” to do it. If you are a business executive and would like help thinking through the “how,” please feel comfortable reaching out.

Previous: Article 16: Measuring What Matters at Level 3 · Next: Article 18: Technology Selection for AI Business Transformation

© 2026 Plaster Group, LLC. All rights reserved. This article may not be reproduced, distributed, or transmitted in any form without prior written permission from Plaster Group. Brief excerpts may be quoted for review or commentary purposes with attribution to the author and a link to the original article.

Ready to move forward?

Let's discuss how your organization can build with AI — securely, strategically, and starting from where you are today.

Start a Conversation