Parallel versus sequential agent execution

Not everything needs to wait in line

In L-0504, you learned that agents operating in sequence need an explicitly defined order. Sequential execution solves the dependency problem: when one agent's output is the next agent's input, running them out of order produces garbage. But sequencing has a cost. It forces every task to wait for every prior task, even when some of those tasks have nothing to do with each other.

Here is the question that separates competent coordination from naive coordination: which of your agents actually depend on each other, and which ones are waiting in line for no reason?

The answer determines whether your system runs in minutes or hours, whether your project finishes in weeks or months, and whether adding more resources accelerates your work or just creates more overhead. Parallel versus sequential execution is not a preference. It is a structural analysis of dependency, and getting it wrong in either direction produces distinct and predictable failures.

The dependency graph: the structure underneath every project

Every multi-step process has a dependency structure, whether you have mapped it or not. Computer scientists formalized this as the Directed Acyclic Graph, or DAG — a set of tasks represented as nodes, with directed edges showing which tasks must complete before others can begin. The graph is acyclic because circular dependencies (A requires B, B requires A) create deadlocks, not workflows.

The power of the DAG is that it makes the parallel/sequential question concrete. Any task with no incoming edges — no prerequisites — can start immediately. Any group of tasks whose prerequisites are all satisfied can run simultaneously. The only tasks that must wait are those with unresolved upstream dependencies. The graph does not tell you to parallelize or serialize. It tells you where you have a genuine choice and where you do not.

The critical path — the longest chain of dependent tasks from start to finish — determines the minimum possible completion time for the entire project, regardless of how many agents or resources you assign. This is not a guideline. It is a mathematical constraint. You cannot compress a project below its critical path any more than you can make a baby in one month by assigning nine women to the task. Frederick Brooks made this observation in The Mythical Man-Month (1975), and it remains one of the most consistently ignored truths in project management: some work is inherently sequential, and no amount of parallelism can accelerate it.

But Brooks' insight has a complement that people miss just as often. The tasks that are not on the critical path — the ones with no dependencies on each other — can and should run in parallel. Serializing independent work is pure waste. It is the organizational equivalent of standing in a single-file line at a buffet with twenty serving stations.

Amdahl's Law: the math of serial bottlenecks

In 1967, computer scientist Gene Amdahl presented a formula that quantifies exactly how much speedup you can achieve by parallelizing a system. The formula is deceptively simple: the maximum speedup is limited by the fraction of work that must remain sequential.

If 10% of your process is inherently serial — tasks that cannot overlap because each depends on the previous one's output — then no matter how many parallel workers you add, you can never achieve more than a 10x speedup. Add a thousand processors, ten thousand agents, infinite resources: the serial fraction is the ceiling. If 50% of the work is serial, the maximum speedup is 2x. If 90% is serial, you cap at 1.11x — barely any improvement at all.

Amdahl's Law reveals something profound about system design: the bottleneck is never the parallel part. Optimization efforts aimed at making already-parallel work faster yield diminishing returns. The leverage is in the serial fraction — in finding ways to restructure genuinely sequential dependencies so that more work can overlap. Every dependency you can eliminate or restructure expands the parallelizable portion and raises the ceiling on how fast the entire system can operate.

This applies identically to cognitive work, organizational projects, and AI agent pipelines. The question is never "How do I run more things in parallel?" The question is "Which of my sequential constraints are real, and which are assumptions I have never examined?"

Your brain already knows both modes

The parallel/sequential distinction is not just an engineering concept. It is wired into your cognitive architecture.

Research on dual-task performance in cognitive psychology has produced decades of evidence about when the brain processes tasks simultaneously and when it serializes them. The psychological refractory period (PRP) — the delay observed when a person must respond to two stimuli in rapid succession — demonstrates that certain kinds of response selection appear to operate through a serial bottleneck. When you are choosing how to respond to one stimulus, your selection of a response to a second stimulus must wait, even if the perceptual processing of both stimuli happened in parallel (Fischer & Plessow, 2015).

But this serial bottleneck is not absolute. Research by Fischer, Gottschalk, and Dreisbach (2014) using fMRI showed that the brain recruits different neural mechanisms depending on whether tasks are processed serially or in parallel. Serial response selection activates the lateral prefrontal cortex — the effortful, controlled processing network. Parallel task processing engages the striatum, a subcortical structure associated with more automatic, habituated processing. Your brain literally uses different hardware for the two modes.

The implication for personal epistemology is direct. Novel decisions, complex judgments, and unfamiliar problem types require serial cognitive processing — you cannot genuinely deliberate on two hard problems simultaneously. But well-practiced routines, habituated monitoring tasks, and familiar procedures can run in parallel without degrading performance. The cognitive strategy, like the engineering strategy, is not to force everything into one mode. It is to know which tasks demand serial attention and which have been practiced enough to run concurrently.

The AI parallel: orchestrators, DAGs, and agent pipelines

If you work with AI agent systems, the parallel/sequential distinction is not a metaphor. It is the central design decision in every multi-agent architecture.

Modern orchestration frameworks — LangGraph, AutoGen, Microsoft's Semantic Kernel — all model agent workflows as graphs. Each agent is a node. Edges define data dependencies: agent B cannot run until it has the output of agent A. The orchestrator analyzes this graph and determines which agents can run simultaneously and which must wait. This is dependency resolution, and it follows exactly the same DAG logic used in build systems, compilers, and project management since the 1960s.

The common orchestration patterns map directly to the parallel/sequential spectrum. Sequential pipelines chain agents one after another — simple but slow, appropriate only when every step genuinely depends on the previous one. Parallel fan-out dispatches multiple agents simultaneously, collects their results, and passes them to a downstream agent — fast but requiring careful design of the merge point. Hierarchical patterns use a supervisor agent that delegates subtasks to worker agents, some in parallel and some sequentially, based on the dependency structure of the problem.

The most effective pattern, emerging across the major frameworks in 2025 and 2026, is the hybrid approach: high-level orchestration that handles strategic sequencing while allowing parallel execution within each phase. A research agent and a data-retrieval agent run simultaneously because neither needs the other's output. Their results then flow sequentially to an analysis agent that requires both inputs before it can begin. The graph makes this structure explicit. Without the graph, you are guessing.

This mirrors how effective human teams work. A product launch involves parallel workstreams — marketing, engineering, legal — that operate independently until a convergence point where their outputs must be integrated. The project manager's job is not to choose parallel or sequential. It is to map the dependency structure, identify which workstreams can overlap, and enforce sequencing only where outputs genuinely flow from one team to another.

The two failures: artificial serialization and premature parallelization

Getting the parallel/sequential balance wrong produces two distinct failure modes, and you need to recognize both.

Artificial serialization is the more common failure. It happens when you treat independent tasks as if they were dependent — waiting for the research to be "complete" before starting the outline, waiting for one vendor quote before requesting a second, waiting for approval on step three before beginning step five even though step five has no relationship to step three. The symptom is a project that takes three times longer than its critical path because every task is queued behind every other task. The cause is usually one of two things: a manager who cannot hold parallel workstreams in their head, or an organizational culture that treats sequential handoffs as the only safe coordination mechanism.

Premature parallelization is the subtler failure. It happens when you launch tasks before their dependencies are resolved. You start writing the implementation before the architecture is decided. You begin marketing a feature before the team has committed to building it. You run three agents simultaneously even though agent two needs agent one's output. The symptom is wasted work: effort invested in paths that get invalidated when an upstream decision changes. In AI agent systems, this manifests as agents producing outputs based on stale or missing inputs, generating results that must be thrown away when the actual upstream data arrives.

Both failures stem from the same root cause: not mapping the dependency structure before deciding how to execute. The parallel/sequential decision is not a style preference. It is a consequence of the graph. Read the graph, and the execution strategy follows.

The protocol: map before you execute

Here is how you apply this in practice, whether you are coordinating AI agents, managing a team, or planning your own work.

Step 1: List every task. Do not organize them yet. Just enumerate the work.

Step 2: For each task, identify inputs. What information, artifact, or decision does this task require before it can begin? If the answer is "nothing" or "only things that already exist," the task has no upstream dependencies.

Step 3: Draw the dependency arrows. Every input requirement becomes an edge from the producing task to the consuming task. You now have a DAG.

Step 4: Identify the critical path. Trace the longest chain of dependent tasks from start to finish. This is your minimum project duration. Everything not on this path has scheduling flexibility.

Step 5: Parallelize everything off the critical path. Any task whose inputs are already available can start immediately, regardless of what else is running. Group independent tasks into parallel batches.

Step 6: Monitor the merge points. Where parallel workstreams converge — where a downstream task needs results from multiple upstream tasks — you need a synchronization mechanism. In AI systems, this is the orchestrator's join operation. In human teams, this is the integration meeting. In personal work, this is the moment you stop generating options and start synthesizing.

This protocol does not require software. It requires a piece of paper and the discipline to ask "what does this task actually need?" before assuming it needs everything.

From execution mode to shared state

You now understand the structural question: which agents can run simultaneously, and which must wait for prior results? The answer lives in the dependency graph — the map of information flow between tasks.

But the dependency graph reveals a second question that is equally important: when tasks do depend on each other, what exactly passes between them? The sequential constraint exists because agent B needs something from agent A. What is that something? A number? A decision? A document? A status flag? The specificity of your answer determines whether the handoff works or breaks.

This is the bridge to L-0506, where you will learn to design shared state between agents. The dependency graph tells you where information must flow. Shared state design tells you what flows and how. Without the graph, you do not know which agents need to communicate. Without shared state design, the agents that need to communicate cannot.

Parallel execution is how you gain speed. Sequential execution is how you maintain correctness. Shared state is how the sequential agents actually talk to each other. You need all three.

Sources:

Amdahl, G. M. (1967). "Validity of the Single Processor Approach to Achieving Large Scale Computing Capabilities." AFIPS Conference Proceedings, 30, 483-485.
Brooks, F. P. (1975). The Mythical Man-Month: Essays on Software Engineering. Addison-Wesley.
Fischer, R., & Plessow, F. (2015). "Efficient Multitasking: Parallel versus Serial Processing of Multiple Tasks." Frontiers in Psychology, 6, 1366.
Fischer, R., Gottschalk, C., & Dreisbach, G. (2014). "Context-Sensitive Adjustment of Cognitive Control in Dual-Task Performance." Journal of Experimental Psychology: Learning, Memory, and Cognition, 40(2), 399-416.
Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2009). Introduction to Algorithms (3rd ed.). MIT Press. [Chapters on DAGs and topological sorting.]
Microsoft Azure Architecture Center. (2025). "AI Agent Orchestration Patterns." Microsoft Learn.
Meadows, D. H. (2008). Thinking in Systems: A Primer. Chelsea Green Publishing.