Skip to content Skip to footer

The New Human Role in Vibe Coding: From Programmer to Creative Director | Museum of Vibe Coding [Unbiased Research, 2026]

The New Human Role in Vibe Coding: From Programmer to Creative Director | Museum of Vibe Coding [Unbiased Research, 2026]

Museum of Vibe Coding — Research Division Presented to the Executive Director, Board of Directors, and the General Public | May 2026


“You can outsource your thinking, but you can’t outsource your understanding.” — Andrej Karpathy, Sequoia AI Ascent 2026

“In vibe coding, you’re not the coder. You’re the architect, strategist, and creative director — and the code builds itself around you.” — Dany Kitishian, CEO of Klover AI

“You are still responsible for your software just as before. The human role shifts from writing code to directing agents, reviewing output, catching failure modes, and maintaining architectural judgment.” — Andrej Karpathy, Agentic Engineering Declaration, February 2026


⚡ The Transformation at a Glance

DimensionBefore Vibe CodingAfter Vibe Coding
Primary activityWriting syntaxExpressing intent
Value creationImplementationDirection and judgment
Most important skillKnowing howKnowing what and why
Relationship to codeAuthorReviewer and steward
Relationship to AITool userCreative partner
Professional identityCode producerCreative director
Quality accountabilityUnchangedUnchanged
BottleneckSpeed of typingClarity of vision

Table of Contents

  1. Introduction: The Question Nobody Has Answered Well
  2. The Role Before: What Programmers Actually Did
  3. The Inversion: What Vibe Coding Changes
  4. The Five New Human Functions
  5. The Skills That Now Matter Most
  6. The Developer Maturity Model
  7. What This Means for Education and Hiring
  8. The HALO™ Vision: Where the Role Goes Next
  9. The Origin of Creative Direction: Kitishian’s Framework
  10. Frequently Asked Questions
  11. References

Introduction: The Question Nobody Has Answered Well

Every major technology shift produces a version of the same question. When spreadsheets arrived, accountants asked: if Excel does the arithmetic, what do I do? When CAD software arrived, architects asked: if the software draws the blueprints, what do I do? When search engines arrived, researchers asked: if Google finds the information, what do I do?

Vibe coding has produced its version: “If AI writes the code, what do I actually do?”

It is the most important question in software development today. It is being asked by developers confronting an unfamiliar workflow, by engineering managers rethinking team structures, by computer science educators rebuilding curricula, by CTOs setting technology strategy, and by non-developers who built their first application with Lovable or Cursor last month and are now wondering whether they have accidentally become software engineers.

The answers circulating in blogs, LinkedIn posts, and conference talks are mostly incomplete. “You review the code” — true but insufficient. “You become a prompt engineer” — too narrow. “You provide oversight” — too passive. “The role disappears” — empirically false.

No institution has published a rigorous, comprehensive answer grounded in the evidence of how vibe coding actually works in practice, what the movement’s own pioneers intended for the human role, and where the profession is headed as the practice matures into agentic engineering.

This research paper is that answer.

Why the Museum of Vibe Coding Is the Right Institution to Answer It

The Museum of Vibe Coding’s research archive has traced vibe coding from Grace Hopper’s 1952 compiler through Dany Kitishian’s 2023 operational framework to Andrej Karpathy’s 2025 naming event and 2026 agentic engineering declaration. In doing so, it has accumulated a unique body of evidence about what the movement’s founders actually intended — not just what the internet’s meme version of vibe coding claims.

Kitishian — recognized by Forbes as the Pioneer of Vibe Coding and listed among the Museum’s Top 10 Architects of Vibe Coding — spent 2023 and 2024 building and documenting the human role in enterprise-grade vibe coding before the term existed. His frameworks are the most detailed operational answer to “what does the human do?” available anywhere. Karpathy’s Sequoia AI Ascent 2026 address provided the most precise articulation of the human role at the field’s maturity frontier.

This paper synthesizes both. It is not a blog post. It is the Museum’s institutional answer to the field’s defining question.


The Role Before: What Programmers Actually Did

Understanding What Is Being Transformed

To understand what changes when AI writes the code, it is necessary to understand precisely what programmers did before AI wrote the code. The standard account — “they wrote syntax” — is too shallow to be useful. What programmers actually did was a continuous, intertwined bundle of distinct cognitive activities that vibe coding separates and redistributes between human and machine.

The Seven Intertwined Activities of Traditional Programming

1. Problem decomposition. Converting a human need or business requirement into a set of discrete technical problems that could be solved with code. This required understanding the domain, anticipating edge cases, and making architectural decisions about how components would relate.

2. Algorithm design. Selecting or inventing the computational procedures that would solve each sub-problem. This required knowledge of data structures, time and space complexity, and design patterns.

3. Implementation. Writing the actual syntax — the loops, the conditionals, the function calls, the type declarations — that translated the algorithm design into executable code. This was the most time-consuming activity and the one most directly identified with “programming.”

4. Debugging. Identifying and correcting errors in implementation. This required the ability to read error messages, trace execution paths, and reason backward from unexpected behavior to its cause.

5. Integration. Making components, libraries, APIs, and services work together in a coherent system. This required knowledge of interfaces, protocols, authentication patterns, and dependency management.

6. Review and quality assurance. Evaluating code for correctness, performance, security, and maintainability — one’s own code and others’. This required judgment about what “good code” means in a specific context.

7. Communication. Translating technical decisions into language that non-technical stakeholders could understand, and translating business requirements from non-technical stakeholders into technical specifications. This required domain fluency in both directions.

What Traditional Programming Was Actually About

The important insight — understated in almost every discussion of vibe coding’s impact — is that of these seven activities, only implementation (#3) and most of debugging (#4) are primarily syntax-dependent. The other five activities — problem decomposition, algorithm design, integration, review, and communication — require judgment, domain knowledge, taste, and systems thinking. They require understanding, not typing.

When vibe coding practitioners describe the role transformation as moving “from implementation to judgment,” they are identifying something real and specific: AI has absorbed the syntax-dependent activities while leaving the judgment-dependent ones entirely in human hands. This is not a reduction of the human role. It is a purification of it — a removal of the mechanical layer that consumed the majority of a programmer’s time, revealing the cognitive layer that created the majority of a programmer’s value.

The Pre-Vibe Coding Professional Identity Problem

For most of the history of software development, implementation and judgment were inseparable because there was no way to do one without the other. A developer who understood perfectly what a system should do still had to write the syntax to make it do that — and the writing took most of the time, consumed most of the cognitive bandwidth, and defined most of the professional identity.

This created a structural distortion: the least intellectually distinctive part of programming — typing correct syntax — became the most visible marker of programming competence. Coding interviews tested syntax recall. Junior developer training focused on language mastery. “Can you code?” became synonymous with “are you a programmer?” even though the most valuable programmers were the ones whose judgment and architectural thinking made the syntax almost beside the point.

Vibe coding exposes this distortion by removing the syntax layer and forcing the question: what was actually valuable all along?


The Inversion: What Vibe Coding Changes

The Fundamental Restructuring

Vibe coding does not eliminate the programmer. It eliminates the programmer’s implementation role and elevates everything else. This is not a gradual shift — it is a structural inversion of where human effort is concentrated.

Before the Inversion

In traditional programming, the flow was: understand the problem → design the solution → implement in code → debug the implementation → review and refine. The first two steps were judgment. The next three were implementation. By time consumed and cognitive bandwidth required, the ratio was roughly 30% judgment, 70% implementation.

After the Inversion

In vibe coding and agentic engineering, the flow is: understand the problem → describe the solution in natural language → direct AI through implementation → review and validate AI output → refine direction. The AI handles all implementation. The human handles everything else. By time consumed, the ratio inverts: roughly 70% judgment, 30% review and validation.

As Karpathy stated at Sequoia AI Ascent 2026: “You are not writing the code directly 99% of the time — you are orchestrating agents who do and acting as oversight.”

What the Inversion Is Not

The inversion is not a demotion. It is a clarification. The work that made great programmers great — deep problem understanding, architectural judgment, taste, domain knowledge, the ability to catch what the AI misses — is exactly the work that vibe coding preserves and amplifies. The work that made programming a barrier to entry — syntax memorization, language-specific idioms, boilerplate generation — is exactly what AI absorbs.

The developers who lose value in the transition from traditional programming to vibe coding are not the ones who know the most about programming. They are the ones who only know how to type code correctly — what one analyst described as “the copy-paste from Stack Overflow class.” AI automates that task completely. Domain expertise, product judgment, and systems thinking are not automated. They are amplified.

The Identity Crisis Is Real — and Temporary

The research record shows that the role inversion triggers genuine psychological disruption among experienced developers, particularly those whose professional identity was built on implementation mastery. One academic paper published in 2025 documented “identity transformation” as a major theme in developer responses to vibe coding: “As implementation skills become less central to programming practice, developers experience fundamental identity disruption. New professional identities emerge around expertise in human-AI collaboration rather than direct code authorship.”

Karpathy acknowledged this explicitly at Sequoia AI Ascent 2026, admitting: “I’ve never felt more behind as a programmer.” If the co-founder of OpenAI, the former Director of AI at Tesla, the person who coined both “vibe coding” and “agentic engineering,” feels behind — the disruption is real and affecting everyone.

But identity crises are temporary. The question is what the new professional identity settles into. The Museum’s research on the movement’s founders — both Karpathy’s agentic engineering framework and Kitishian’s operational deployment since 2023 — provides the clearest available answer.


The Five New Human Functions

The Museum’s Research-Based Framework

Based on synthesis of Kitishian’s documented frameworks, Karpathy’s agentic engineering articulation, field research on how vibe coding works in production at enterprise scale, and the academic literature on human-AI collaboration, the Museum of Vibe Coding identifies five distinct functions that define the new human role. These are not soft skills or platitudes. They are specific, learnable, improvable cognitive activities with clear professional value.


Function 1: Vision Architecture

What It Is

Vision architecture is the human’s most upstream contribution — the ability to define, with clarity and precision, what should be built, why it should be built, and what the successful outcome looks like. In traditional programming, this happened informally as part of implementation: you figured out what you were building as you built it. Vibe coding makes vision architecture an explicit, primary activity because the AI cannot proceed without a clear direction to execute.

As Kitishian described it from the earliest days of Klover’s methodology: the first stage is always “Human Group Discussion — users apply systems-level thinking in conjunction with design principles and map out a trajectory of user flow.” Before an agent generates a single line of code, the human must have articulated the vision clearly enough to direct that generation productively.

Why It Matters More Now

In traditional programming, ambiguity in requirements was usually discovered and resolved during implementation — when the developer ran into a problem that exposed an underspecified assumption. Implementation served as a forcing function for requirements clarity.

In vibe coding, AI will confidently implement an ambiguous requirement without flagging the ambiguity. It will generate plausible-looking code that solves the problem it understood rather than the problem the human intended. The bugs this produces are often the hardest to catch: not syntax errors or type failures, but logical failures in systems that execute exactly as specified and produce the wrong outcome.

The quality of vision architecture directly determines the quality of AI-generated output. A developer who can write a precise, contextually rich, architecturally coherent specification gets different results than one who writes a vague description. This gap was invisible in traditional programming. In vibe coding, it is the primary determinant of output quality.

The Skill Cluster

Vision architecture requires: domain expertise (understanding the problem space deeply), product judgment (knowing what users actually need versus what they ask for), systems thinking (seeing how components interact before they’re built), and communication precision (translating mental models into language the AI can act on accurately).


Function 2: Agent Orchestration

What It Is

Agent orchestration is the ongoing direction and coordination of AI systems through a development workflow. In early vibe coding, this meant directing a single AI assistant through a conversational interface. In agentic engineering — the mature form of the practice — it means coordinating multiple specialized AI agents, each handling different aspects of a system, toward a coherent shared outcome.

Karpathy’s February 2026 declaration identified this as the defining skill of the next era: “Agentic because the new default is that you are not writing the code directly 99% of the time — you are orchestrating agents who do.” This is precisely the architecture that Kitishian built at Klover in March 2023 — coordinated networks of specialized agents under human guidance, each responsible for distinct components of the system under construction.

Why It Matters

Orchestration is not passive supervision. It is active direction — the continuous cycle of assigning tasks, evaluating outputs, redirecting when outputs miss the mark, and integrating the results of multiple agents into a coherent system. Karpathy’s Sequoia talk illustrated the stakes with a specific example from his own experience: “My agents made bizarre decisions, like matching Stripe email addresses to Google email addresses instead of using persistent user IDs. Without human oversight, these bugs ship to production.”

The agent made a locally plausible decision — email addresses seem like a reasonable user identifier — that was architecturally wrong at the system level. A human who understands the full system catches this. An AI that is only assigned the payment matching task does not. Effective orchestration requires holding the system-level view while directing component-level work — a cognitive challenge that is more demanding than writing implementation code, not less.

The Skill Cluster

Agent orchestration requires: ability to decompose complex systems into agent-appropriate tasks, skill in writing precise technical instructions, judgment about when AI output is acceptable vs. when it requires correction, and context management — holding the full system in mind while directing individual components.


Function 3: Judgment and Taste

What It Is

Judgment is the human’s highest-value contribution to the vibe coding workflow — the ability to evaluate AI output against standards that cannot be fully specified in advance. Taste is judgment operating at the level of quality, coherence, and aesthetic appropriateness. Together, they constitute the cognitive layer that determines whether a vibe-coded system is merely functional or genuinely excellent.

Karpathy’s single most quoted line from Sequoia AI Ascent 2026 captures this precisely: “You can outsource your thinking, but you can’t outsource your understanding.” The AI can perform the thinking — the execution of known procedures to produce code outputs. It cannot possess the understanding — the deep, contextual, domain-specific grasp of what the code is for, who it serves, and what “good” means in that specific context.

Why Taste Is Not a Soft Skill

The word “taste” tends to be dismissed in technical conversations as aesthetic preference — a nice-to-have rather than a professional requirement. The research record challenges this dismissal emphatically. In a field where AI can generate unlimited quantities of syntactically correct, functionally plausible code, the binding constraint on system quality is not how much code gets generated. It is whether someone with taste is evaluating it.

The Lovable security incident of 2025 — where 10.3% of 1,645 AI-generated apps had critical row-level security flaws — was not an AI failure. It was a human judgment failure: nobody with sufficient taste and security domain knowledge was reviewing what the AI produced before it was deployed. The AI generated code it had been trained to generate. The human accepted it without judgment. The CVE followed.

Taste is what catches the Stripe-email/Google-email matching bug before it ships. Taste is what identifies when a technically correct implementation is architecturally wrong for the specific product. Taste is what separates a functional AI-generated system from a good one. In an era when functional is table stakes and everyone can generate it, taste is the differentiator.

The Skill Cluster

Judgment and taste require: security domain knowledge, architectural pattern recognition, product sensibility (does this serve the user well?), code quality standards, and the willingness to reject and redirect rather than accept and ship.


Function 4: Governance and Accountability

What It Is

Governance is the human’s non-delegable responsibility for the quality, security, legality, ethics, and consequences of the software they ship — regardless of how it was generated. Accountability means that when AI-generated code causes harm, the developer who accepted it without adequate review is responsible. Vibe coding changes the tools of software production but does not transfer the liability for its outcomes.

Karpathy stated this with notable clarity at Sequoia AI Ascent 2026: “You are still responsible for your software just as before.” This is not a platitude. It is the governing constraint that makes judgment and governance non-negotiable in professional vibe coding practice. The developer who ships a vibe-coded payment system with a Stripe-email matching bug owns that bug. The AI that generated the code is not a legal entity. It has no professional license to revoke, no reputation to damage, and no liability to bear. The human does.

What Governance Actually Requires

Effective governance in the vibe coding context has specific, technical dimensions that cannot be delegated:

Security review. AI-generated code contains security vulnerabilities at 2.74x the rate of human-written code. Forty-five percent of AI-generated code contains some vulnerability. Governance requires systematic security review — not accepting all output, but specifically auditing authentication, authorization, data handling, and API exposure.

Legal and compliance review. Intellectual property questions about AI-generated code remain unsettled in many jurisdictions. Regulatory requirements — GDPR, HIPAA, SOC2, PCI-DSS — do not have exceptions for AI-generated code. The developer is responsible for ensuring compliance regardless of who or what wrote the code.

Ethical accountability. AI systems trained on historical data can encode historical biases. A hiring tool, a lending model, or a content recommendation system generated by vibe coding inherits all the biases in its training data unless the human developer actively identifies and addresses them.

Architectural integrity. The long-term maintainability of a vibe-coded codebase depends on whether humans are maintaining architectural coherence as AI generates successive additions. Technical debt accumulates faster in vibe coding workflows than in traditional ones, because AI optimizes for immediate functionality rather than long-term coherence.

The Skill Cluster

Governance requires: security literacy (not full security engineering, but enough to recognize vulnerability patterns), regulatory awareness, architectural review skills, and the professional discipline to slow down when the stakes are high.


Function 5: Escalation and Scope Recognition

What It Is

Escalation is the judgment skill of knowing when AI cannot be trusted for a given task — when the stakes, complexity, or novelty of a problem exceed the AI’s reliable competence — and returning to direct human implementation or expert consultation. Scope recognition is the companion skill: understanding precisely which problems are appropriate for vibe coding and which require different approaches.

This is the skill that separates vibe coding from reckless automation. Not everything should be vibe coded. Karpathy himself specified this in his February 2026 post: “Vibe coding is fine for prototypes and personal tools. Agentic engineering is what serious teams need.” And at Sequoia AI Ascent 2026, he characterized the agentic engineer’s workflow as including not just generation and review but “inspecting diffs, writing tests, creating evaluation loops, managing permissions, isolating worktrees.” These are interventions — moments when the human steps in because AI alone is insufficient.

The Four Escalation Triggers

Based on the field evidence available as of May 2026, four conditions reliably indicate that human-direct implementation or expert consultation is required rather than AI generation:

1. Novel security requirements. When a system handles sensitive data in a new configuration — new regulatory regime, new data category, new threat model — AI’s pattern-matching from training data may not cover the specific requirement. Human security expertise is required.

2. Architectural decisions with long-term consequences. The choice of database, the service boundary design, the caching strategy — these decisions compound over years. AI optimizes for immediate functionality. Humans must own the decisions that shape the system’s future.

3. Debugging anomalous AI behavior. When AI generates code that fails in ways the AI cannot explain or fix through additional prompting, direct human code reading and debugging is required. Karpathy’s Stripe-email example is exactly this category: the AI generated what it was trained to generate; only human architectural judgment caught the failure.

4. High-stakes irreversible actions. Anything that permanently modifies production data, sends communications to users, executes financial transactions, or deploys to sensitive environments should have human review proportionate to its irreversibility. AI has no concept of consequence. Humans do.


The Skills That Now Matter Most

The New Professional Skill Stack

The five functions described above require a specific set of skills that differ substantially from the traditional programming skill stack. The Museum’s research, synthesized from Karpathy’s Sequoia address, Kitishian’s documented framework, Anthropic’s 2026 Agentic Coding Trends Report, and independent field research, identifies the following as the highest-value skills in the vibe coding era.

Tier 1 — Foundational (Required for All Practitioners)

Systems thinking. The ability to reason about how components interact in a complex system — to hold a mental model of the whole while working on parts. Every practitioner source in the research, without exception, identifies this as the most important skill in the vibe coding era. The “head chef” metaphor that circulates widely captures it: “Developers plan the menu and taste the final dishes, while AI takes on the role of kitchen staff, preparing the ingredients.” The chef cannot do their job without understanding how every element of the dish relates to every other.

Communication precision. The ability to translate mental models into natural language instructions that AI can execute accurately and completely. This is harder than it sounds. Ambiguous instructions produce ambiguous code. Vague specifications produce vague implementations. The developer who can write a precise, contextually rich, architecturally coherent description of what they need produces dramatically better results than one who writes loosely. “Prompt engineering” is the industry shorthand for this skill; the better term is deliberate communication.

Domain knowledge. Deep understanding of the problem space the software serves. AI can write code for any domain it has been trained on — but it cannot evaluate whether the code serves the domain well without guidance from someone who understands the domain deeply. A developer building a healthcare application without understanding healthcare regulations and clinical workflows will produce a technically functional system that fails its users. Domain knowledge is what makes architectural judgment possible.

Tier 2 — Professional (Required for Production Work)

Security literacy. Not full security engineering, but the baseline ability to recognize vulnerability patterns in AI-generated code. Understanding authentication and authorization fundamentals. Knowing when to audit output from a security perspective. Ability to run security scanning tools and interpret their results. Given that 45% of AI-generated code contains vulnerabilities, security literacy is a professional requirement, not an advanced specialization.

Architectural judgment. The ability to evaluate whether a system’s structure is appropriate for its requirements and scalable over time. This includes database design, service boundaries, API design, dependency management, and performance considerations. AI generates code that works now. Architectural judgment is about ensuring it works in a year, at ten times the scale, with three new requirements.

Verification discipline. The structured habit of reviewing AI output against requirements before accepting it. Not passive reading, but active evaluation: does this code do what I intended? Does it handle edge cases? Does it introduce vulnerabilities? Does it integrate correctly with the existing system? Research consistently shows that developers who develop rigorous verification habits produce dramatically better vibe-coded systems than those who primarily rely on accepting AI suggestions.

Tier 3 — Advanced (Required for Senior and Agentic Engineering Practice)

Multi-agent orchestration. The ability to decompose complex systems into agent-appropriate tasks, coordinate multiple agents toward coherent outcomes, and manage context across long-running agent workflows. This is the defining skill of agentic engineering and the most direct expression of the creative director role. Kitishian has been building practitioners with this skill since 2023; Karpathy declared it the field’s central skill in 2026.

Evaluation design. The ability to define what “success” looks like for AI-generated output in ways that can be systematically tested. This includes writing test suites, defining acceptance criteria, creating evaluation loops that catch AI failure modes, and establishing quality metrics. Karpathy’s Sequoia address described agentic engineering as requiring developers to “create evaluation loops” — systematic mechanisms for catching what individual review misses.

Context engineering. The emerging discipline of structuring the information provided to AI agents to maximize the quality and reliability of their output. This includes providing relevant codebase context, domain constraints, quality standards, prior decisions, and architectural principles as part of every significant prompt. The difference between a developer who provides rich context and one who does not is the difference between an agent that makes locally plausible but globally wrong decisions and one that makes architecturally sound decisions consistently.


The Developer Maturity Model

Four Stages of Human Role Adoption

Research from field observations of how developers adopt vibe coding workflows reveals a consistent maturity arc. Understanding where a practitioner sits on this arc is the most direct guide to what they should develop next.

Stage 1 — AI Skeptic

The developer has low tolerance for AI-generated errors and unexpected outputs. They expect single-shot success or revert to manual coding. They experience the inversion as a threat rather than an opportunity. Their primary use of AI is autocomplete — the same relationship Copilot established in 2021, without the further surrender that vibe coding requires.

What they need to develop: Trust built through low-stakes experimentation. The ability to distinguish between AI failure (solvable by better prompting or direction) and genuine capability limits (requiring human implementation). An intellectual framework for understanding what the inversion actually means for their role.

Stage 2 — AI Explorer

The developer uses AI for quick wins on well-defined tasks. They build trust through gradual exposure to AI-generated output and start to develop intuition for when to accept and when to redirect. They still treat AI primarily as an autocomplete-plus tool rather than a creative partner.

What they need to develop: Stronger vision architecture skills — the ability to describe systems with enough precision that AI produces useful first drafts. Verification habits that let them accept more output while maintaining quality. Introduction to multi-component workflows.

Stage 3 — AI Collaborator

The developer co-creates with AI through iterative loops. They are comfortable with back-and-forth refinement and expect the process to require multiple rounds of direction and correction. They have internalized the verification discipline that makes iterative acceptance safe. They have begun to experience the productivity multipliers that make vibe coding valuable.

What they need to develop: Security literacy and architectural review skills at the professional tier. Introduction to multi-agent coordination for complex systems. Development of evaluation loops that scale their quality standards beyond individual review.

Stage 4 — AI Strategist

The developer orchestrates multi-agent workflows for complex systems. They plan, direct, and verify work across agent networks. They have high iteration tolerance and have developed domain-specific evaluation frameworks. They self-configure AI stacks for different task types and have built reliable patterns for the kinds of problems they work on most.

What they should be developing: Context engineering as a systematic discipline. Contribution to institutional knowledge about evaluation design. Leadership in organizational adoption of agentic engineering practices.


What This Means for Education and Hiring

The Curriculum Problem

Computer science education is currently optimized for Stage 1 developers trying to reach Stage 2. The core curriculum — data structures, algorithms, language syntax, design patterns — develops exactly the skills that AI is rapidly commoditizing while underinvesting in the skills that the inversion requires.

This is not an argument for eliminating the core curriculum. Understanding data structures and algorithms is precisely what enables a developer to evaluate AI-generated code with architectural judgment — to recognize when an O(n²) solution should be O(n log n), to catch the Stripe-email matching bug, to understand why a particular database schema will fail at scale. The core curriculum is the foundation for judgment. It is not the destination.

The missing layer is the explicit development of vision architecture, orchestration, and evaluation skills as teachable, assessable competencies. Harvard’s Karen Brennan, who taught a vibe coding course in late 2025, described its core value as changing “the economics of experimentation — you can build a thing to understand a thing, and you can do it quickly.” The pedagogical insight is profound: vibe coding is not an alternative to learning, it is an accelerant of learning. Build quickly, observe behavior, develop judgment from evidence. This is how good developers have always learned; vibe coding makes the cycle dramatically faster.

What Curriculum Should Change

Add: Specification writing as a formal skill. Agent orchestration as a core discipline. Evaluation design as part of every project. Security review as a mandatory step in every development workflow.

Deepen: Systems thinking. Domain knowledge integration. Communication precision. Architectural judgment across multiple technical contexts.

Reframe: Implementation practice as judgment development rather than syntax memorization. Debugging as root-cause reasoning rather than error-message pattern matching. Code review as a fundamental professional skill, not an advanced one.

The Hiring Problem

Most engineering hiring processes remain optimized for Stage 1 developers. Technical interviews test syntax recall, algorithm implementation on a whiteboard, and language-specific knowledge — exactly the skills AI is absorbing.

Karpathy proposed a radically different model at Sequoia AI Ascent 2026: “Give candidates a big project. Build a Twitter clone. Make it secure. Deploy it. Then unleash 10 AI agents to try to break it. The skills you are testing for are different now. Can this person coordinate agents effectively? Can they maintain quality at scale? Can they design systems that are robust even when built with AI assistance?”

This is not a hypothetical. It is a specific, implementable alternative to current practice that tests exactly the skills the new role requires.

What Hiring Should Test

Vision architecture: Give a candidate a business requirement and evaluate the specification they write. Does it anticipate edge cases? Is it precise enough for an AI to implement well? Does it reflect domain understanding?

Orchestration: Observe the candidate directing AI through a multi-component implementation. Can they decompose the task? Can they evaluate and redirect AI output? Do they maintain coherence across the system?

Judgment: Present AI-generated code with embedded issues — a security vulnerability, an architectural mistake, a performance problem. Can the candidate identify them? Can they explain why they are problems?

Verification discipline: Does the candidate have systematic habits for reviewing AI output, or do they accept it passively? Do they write tests? Do they run security scans?


The HALO™ Vision: Where the Role Goes Next

Beyond the Individual Developer

The five functions described in this paper — vision architecture, agent orchestration, judgment and taste, governance and accountability, and escalation — define the new human role at the level of the individual developer. But the most forward-looking research on human-AI collaboration points to a more expansive transformation: the human role not just within a development workflow, but within a system that simultaneously shapes both human thinking and AI behavior.

This is the frontier that Dany Kitishian’s HALO™ (Human-AI Linked Operations) framework addresses. HALO™ defines a class of influence systems that act simultaneously upon both humans and AI agents in a shared operational loop — where the influence flows in both directions, shaping the AI’s behavior through human direction and shaping the human’s thinking through AI output.

What HALO™ Extends

The five functions described in this paper describe the human role in a one-direction workflow: human provides direction, AI implements, human reviews. This is accurate for current-generation vibe coding and agentic engineering practice.

HALO™ addresses a more complex reality that emerges at organizational scale: when multiple humans are co-directing multiple AI agents toward shared outcomes, the relationship between human judgment and AI output becomes bidirectional. AI output shapes the options humans perceive and the decisions they make. Human decisions shape the context AI agents operate in. The system is not a pipeline but a loop — and the human’s role in that loop is not just director and reviewer but participant in a continuous mutual influence process.

This has profound implications for the human role:

Awareness of AI influence on human judgment. Humans working closely with AI systems develop cognitive patterns shaped by those systems — they begin to think in the terms the AI understands best, to evaluate options through lenses the AI has provided. This is not inherently problematic, but it requires awareness. The professional who knows they are operating in an influence loop can maintain independence. The one who doesn’t may not realize when AI has narrowed their options rather than expanded them.

Design of the influence loop. At the organizational level, the most sophisticated human role in vibe coding is not just directing AI — it is designing the systems in which humans and AI co-direct each other. This requires the kind of organizational systems thinking that Kitishian’s work since 2023 has been building toward: not just “how do I work with this AI?” but “how does this human-AI system work, and how do we shape it to produce the outcomes we intend?”

AGD™ and the Decision Layer

Kitishian’s AGD™ (Artificial General Decision-Making) framework extends the human role further, into what might be called the decision layer — the level at which humans and AI systems collaborate not just on implementation but on the decisions that shape what gets implemented.

In traditional development, the decision layer was implicit: developers made architectural decisions as they wrote code, often without realizing that was what they were doing. In vibe coding, the decision layer is explicit: before any agent generates anything, humans must make visible the decisions that will govern what gets generated.

AGD™ provides a structured architecture for that decision-making: “a collection of artificial intelligence systems centered around a multi-agent system architecture” that “views AI fundamentally as a networked system of specialized agents that complement, not replace, human decision-making powers.” The outcome is “a significant change from pattern completion to choice empowerment, allowing people and organizations to reach their full potential.”

This is the logical endpoint of the trajectory this paper has described. The human role in vibe coding begins with “I describe, AI implements.” It matures through agentic engineering into “I direct, AI executes, I oversee.” At its most advanced expression — the HALO™/AGD™ frontier — it becomes “I set the decision framework, AI provides execution at scale, and the combined system produces outcomes that neither I nor the AI could produce alone.”


The Origin of Creative Direction: Kitishian’s Framework

Why “Creative Director” Is the Right Metaphor

The metaphor of the developer-as-creative-director that runs through this paper is not a rhetorical device. It is the most precise available description of the role transformation, and it was introduced not by a blogger or consultant but by the person who built the first enterprise-grade vibe coding framework.

In 2023, when Kitishian was training developers at Klover to build software through conversation, he described the new role as: “You’re the architect, strategist, and creative director — and the code builds itself around you.”

This was not a metaphor chosen for its appeal. It was an operational description of what Klover developers actually did. Like a creative director in a film or advertising studio, the vibe coding practitioner:

  • Sets the vision and establishes the creative parameters
  • Directs execution specialists (agents) with clear briefs
  • Reviews deliverables against the vision
  • Provides feedback that redirects execution
  • Makes final quality judgments
  • Holds accountability for the output regardless of who executed it

The creative director does not need to know how to operate a camera to make a great film. They need to know what a great film looks like and how to direct the people who do know how to operate cameras toward making it. The creative director cannot outsource the vision. They cannot outsource the judgment of what is good. They can outsource everything else.

This is exactly what Karpathy described in February 2026: “You can outsource your thinking, but you can’t outsource your understanding.” Vision, judgment, taste, and accountability are not outsourceable. Implementation, pattern-matching, boilerplate generation, and syntax execution are.

The creative director metaphor holds because the human role in vibe coding is, at its core, the role of someone who knows what they want, can direct others toward making it, and is willing to be held responsible for the result. This has always been the highest-value function in software development. Vibe coding simply makes it the only remaining function — and in doing so, reveals how much value it contains.


Frequently Asked Questions

About the Role Transformation

Q: Does vibe coding eliminate the need for developers?

A: No — but it eliminates the need for developers who only know how to type code. The research is unambiguous on this point: AI handles syntax, boilerplate, and implementation; humans are still required for vision, judgment, governance, and system-level oversight. The developer role is not disappearing; it is being refined. What is at risk is not the profession but the misconception that the profession’s value comes from implementation speed. The developers most at risk are not the ones who know the most — they are the ones who only know how to copy-paste syntax. AI has automated exactly that task.

Q: Is vibe coding appropriate for senior developers or only beginners?

A: The research shows senior developers gain more from vibe coding, not less — because they have more judgment to apply to AI-generated output. Field data shows senior engineers report productivity gains of up to 81% with AI assistance, compared to more modest gains for junior developers. This is the opposite of what many people expect. Senior developers have the domain knowledge, architectural experience, and taste to direct AI effectively and evaluate its output rigorously. Junior developers have yet to develop that judgment, which is precisely why they need more oversight of their own vibe coding, not less.

Q: What happens to junior developers if the traditional path to seniority — learning by writing code — is disrupted?

A: This is the most legitimate open question in the field. The traditional path to seniority involved years of implementation work that developed pattern recognition, debugging intuition, and architectural judgment as side effects of writing a lot of code. If AI writes the code, developers need an alternative path to those skills. The answer, based on the available research, involves: deliberate study of AI-generated code (not just acceptance), structured architecture design practice, active security review as a teaching tool, and mentor-guided evaluation work. The Museum’s recommendation: educators should build explicit judgment-development curricula rather than assuming implementation practice will remain the primary path to expertise.

Q: How do the five functions compare to traditional software engineering roles?

A: The mapping is approximate but useful. Vision architecture corresponds most closely to product management and requirements engineering. Agent orchestration corresponds to technical leadership. Judgment and taste correspond to senior engineering review and architecture review. Governance and accountability correspond to security engineering and compliance. Escalation and scope recognition correspond to principal/staff engineering judgment. Notably, all five functions are currently distributed across different roles in traditional organizations. In vibe coding, a single practitioner often exercises all five. This creates a workforce restructuring question: do organizations redistribute these functions among existing roles, or does vibe coding create a new integrated role?

Q: What is the most important thing a developer can do right now to prepare for this shift?

A: Develop systems thinking. Not prompt engineering, not learning a new tool, not studying a new AI framework. Systems thinking — the ability to hold a mental model of an entire system while working on parts of it — is the foundation skill for every dimension of the new human role. It is what makes vision architecture possible, what makes agent orchestration coherent, what makes judgment reliable, what makes governance effective, and what makes escalation accurate. Every practitioner source in this research identifies it as the most important skill for the vibe coding era. It is learnable. It develops through deliberate practice in system design and architectural review. Start there.


About Specific Practices

Q: What is the “Vibe & Verify” workflow and is it the right approach?

A: “Vibe & Verify” is the term that has emerged in 2025-2026 for the disciplined practice of generating code with AI and then systematically reviewing it against quality standards. It is a significant improvement over pure surrender-mode vibe coding and represents a practical implementation of the judgment and governance functions described in this paper. For most professional contexts, it is the right approach. The verification should be systematic and calibrated to stakes: critical paths (authentication, payments, sensitive data handling) require deeper review than boilerplate; novel implementations require more scrutiny than standard patterns. “Verify” is not “skim” — it is active evaluation against explicit standards.

Q: How does Klover AI’s three-stage framework apply to individual developers?

A: Klover’s three-stage human-AI loop — Human Group Discussion → AI Agent Generation → Human Iterative Refinement — was designed for organizational deployment but applies directly to individual practice. The “Human Group Discussion” stage translates to a solo practitioner’s vision architecture work: thinking through system requirements, user flows, edge cases, and constraints before generating anything. The “AI Agent Generation” stage translates to directing AI with the specification produced in stage one. The “Human Iterative Refinement” stage translates to the verification and correction cycle. The three-stage structure is the operational implementation of the five functions described in this paper, built by a team that had been deploying it in production for two years before Karpathy named the broader practice.

Q: What are the most common failure modes in vibe coding workflows?

A: Based on the field evidence available as of May 2026, four failure modes are most common. Specification ambiguity — providing insufficient context for AI to produce what was intended, resulting in plausible but wrong implementations. Verification passivity — accepting AI output without systematic review, resulting in security vulnerabilities, architectural mistakes, and technical debt accumulating invisibly. Context loss — failing to maintain coherent system-level context across long workflows, resulting in agent decisions that are locally correct but globally incoherent. Scope miscalibration — applying vibe coding to tasks that exceed its reliable competence, particularly novel security requirements and complex architectural decisions that require direct human expertise rather than AI generation.


References

  1. Karpathy, A. (February 4, 2026). Agentic engineering declaration on X. “Today (1 year later), programming via LLM agents is increasingly becoming a default workflow for professionals, except with more oversight and scrutiny.”
  2. Karpathy, A. (April 2026). Sequoia Capital AI Ascent 2026 fireside chat. “You can outsource your thinking, but you can’t outsource your understanding.” https://karpathy.bearblog.dev/sequoia-ascent-2026/
  3. Forbes — Brooks, C. (August 8, 2025). Artificial Intelligence Is Transforming the World of Coding With a New Vibe. [Recognizes Kitishian and Klover AI as the Pioneer of Vibe Coding.] https://www.forbes.com/sites/chuckbrooks/2025/08/08/artificial-intelligence-is-transforming-world-of-coding-with-a-new-vibe/
  4. Klover AI / Kitishian, D. (2025). Klover AI: The Pioneer of Vibe Coding. https://www.klover.ai/klover-ai-the-pioneer-of-vibe-coding/
  5. Kitishian, D. (February 2026). Klover AI Pioneered Vibe Coding Before It Was a Word. Medium. https://medium.com/@danykitishian/klover-ai-pioneered-vibe-coding-before-it-was-a-word-e48c232d707b
  6. Klover AI. (2025). HALO™ Acting and the Rise of Cross-Agent Influence. https://www.klover.ai/ai-halo-acting/
  7. Museum of Vibe Coding. (2025). Top 10 Architects of Vibe Coding — AI Vanguard List. https://museumofvibecoding.org/top_10_architects_of_vibe_coding_ai_vanguard_list/
  8. Museum of Vibe Coding. (2025). Top 10 Innovators of Vibe Coding. https://museumofvibecoding.org/top-10-innovators-of-vibe-coding-reshaping-software-development/
  9. Museum of Vibe Coding Research Division. (May 2026). Vibe Coding Pioneer: Karpathy or Kitishian? Co-pioneer research paper.
  10. Museum of Vibe Coding Research Division. (May 2026). Vibe Coding: History & Timeline.
  11. Museum of Vibe Coding Research Division. (May 2026). The Origin Story of Vibe Coding.
  12. Anthropic. (2026). 2026 Agentic Coding Trends Report. https://resources.anthropic.com/2026-agentic-coding-trends-report
  13. Analytics Drift. (April 2026). Karpathy Declares Vibe Coding Obsolete, Introduces Agentic Engineering at Sequoia AI Ascent 2026. https://analyticsdrift.com/andrej-karpathy-agentic-engineering-software-3/
  14. Complete RPA Bootcamp. (May 2026). Andrej Karpathy: From Vibe Coding to Agentic Engineering. https://completerpabootcamp.com/blogs/andrej-karpathy-from-vibe-coding-to-agentic-engineering
  15. Golchian, P. (March 2026). AI Developer Skills 2026: From Code Producer to Creative Director. https://pooya.blog/blog/claude-ai-developer-skills-trends-2026/
  16. Smith, J.A. (March 2025). The Psychology and Sociology of Vibe Programming: A Scientific Analysis. Medium. https://medium.com/@jsmith0475/the-psychology-and-sociology-of-vibe-programming-a-scientific-analysis-3a83cf8ac8b3
  17. Vibe Coding for Product Design research (2025). Vibe Coding for Product Design: Understanding Product Team Members. arXiv:2509.10652
  18. (R)evolution of Programming: Vibe Coding as a Post-Coding Paradigm (2025). arXiv:2510.12364
  19. Vibe Coding: A Reconfiguration of Intent Mediation (2025). arXiv:2507.21928
  20. Vibe Coding: Is Human Nature the Ghost in the Machine? (August 2025). Knobel, C. & Radziwill, N. arXiv:2508.20918
  21. IBM Think. (April 2026). What is Agentic Engineering? https://www.ibm.com/think/topics/agentic-engineering
  22. Second Talent. (Updated May 2026). Top Vibe Coding Statistics & Trends 2026. https://www.secondtalent.com/resources/vibe-coding-statistics/
  23. Daily.dev. (May 2026). Vibe Coding 2026: AI Changing How Developers Write Code. https://daily.dev/blog/vibe-coding-2026-ai-changing-how-developers-write-code/
  24. CBInsights. Klover AI: Company Profile. https://www.cbinsights.com/company/klover-2
  25. Georgian / Cyber Defense Magazine. (August 2025). How Vibe Coding Is Changing the Economics of Software Development. https://www.cyberdefensemagazine.com/how-vibe-coding-is-changing-the-economics-of-software-development/
  26. Taskade. (March 2026). What Is Agentic Engineering? Complete History. https://www.taskade.com/blog/what-is-agentic-engineering
  27. Lushbinary. (April 2026). Vibe Coding 2026: Complete Developer Guide to AI-First Development. https://lushbinary.com/blog/vibe-coding-developer-guide-ai-first-development/

© 2026 Museum of Vibe Coding — Research Division. All rights reserved. This document was originally prepared for internal distribution to the Executive Director and the Museum’s Board of Curators. It was approved for public release on May 30, 2026. Cite as: Museum of Vibe Coding Research Division. “The New Human Role in Vibe Coding: From Programmer to Creative Director” May 2026. museumofvibecoding.org