Category: Blog

Your blog category

Governing the Autonomous: A Deep Dive into AI Agent Governance Frameworks

The landscape of artificial intelligence is undergoing a profound transformation. We are rapidly moving beyond static, reactive models to a new generation of autonomous AI agents—systems capable of independent decision-making, tool use, and goal-seeking behavior in dynamic environments. This shift, while promising unprecedented productivity gains, introduces a critical new challenge: how do we govern systems that are designed to act on their own?

The answer lies in establishing robust AI Agent Governance Frameworks. Without them, organizations risk accumulating a significant “governance debt”—the costly and ineffective process of retrofitting security, compliance, and ethical controls onto a functional prototype. This article explores the urgency of this new governance model, details the four essential pillars of an effective framework, outlines best practices for implementation, and examines the emerging regulatory landscape.

The Urgency: Why AI Agents Demand a New Governance Model

Traditional AI governance, focused primarily on model training data and deployment, is insufficient for the agentic paradigm. The core difference is autonomy. An agent’s ability to sense, reason, plan, and act independently in a complex ecosystem elevates the risk profile significantly.

The key challenges that necessitate a specialized governance framework include:

Unpredictable Autonomy: Unlike a fixed application, an agent’s actions are not entirely predetermined. Its ability to choose tools, modify its plan, and learn from interactions can lead to emergent and unpredictable behaviors that are difficult to trace and control.
Expanded Attack Surface: Agents are often granted access to a suite of external tools, APIs, and sensitive data sources to perform their tasks. This broad access, combined with the agent’s autonomy, creates a tempting target for malicious use and increases the potential for catastrophic unintended actions.
Goal Misalignment and Drift: The risk that an agent, in its pursuit of a specific, narrow objective, may take actions that violate broader organizational policies, ethical standards, or legal requirements. This is a form of “optimization gone wrong” that requires constant oversight.

Technical Risks Unique to Agentic Systems

The technical architecture of AI agents introduces specific vulnerabilities that traditional security models fail to address. These risks are directly tied to the agent’s ability to reason and use tools:

Prompt Injection: This is arguably the most critical security risk. An attacker can manipulate an agent’s behavior by injecting malicious instructions into a user prompt or even into data the agent processes. Because autonomous agents make decisions without constant human input, a successful prompt injection can lead to unauthorized actions, data exfiltration, or systemic compromise.
Tool Misuse and Privilege Compromise: Agents are defined by their ability to use tools (e.g., calling APIs, executing code, accessing databases). If an agent’s credentials are stolen or its logic is compromised, an attacker can leverage the agent’s broad access to perform unauthorized actions, such as deleting data or making financial transactions. This is compounded by the principle of Least Privilege Access being violated in the rush to deploy.
Memory Poisoning: Agents often maintain a “memory” or context of past interactions to inform future decisions. An attacker can “poison” this memory with malicious or biased information, leading to persistent, harmful behavior that is difficult to detect and remediate.

To mitigate these risks, governance must be a first-class citizen from the moment an agent is conceived, not a final, rushed checklist item before deployment.

The Four Pillars of AI Agent Governance

Effective AI agent governance rests on four interconnected pillars, each addressing a specific dimension of the agent’s lifecycle and operation. These pillars move beyond simple policy documents to encompass technical controls and continuous processes.

Pillar	Guiding Principle	Core Focus	Implementation Tools/Practices
1. Lifecycle Management	Separation of Duties	Governing how an agent is built, updated, and maintained across environments.	Version control (Git), CI/CD pipelines, distinct Dev/Staging/Prod environments, mandatory code/change reviews, and deployment tools with instant rollback capabilities.
2. Risk Management	Defense in Depth	Protecting the agent from failure modes, unintended consequences, and compliance violations.	Data quality monitoring, PII detection and masking, behavioral guardrails, compliance checks, model validation suites, and content filters on inputs and outputs.
3. Security	Least Privilege Access	Controlling and verifying access to the agent, its tools, and the data it interacts with.	Granular access controls (RBAC), API key management, Single Sign-On (SSO), Multi-Factor Authentication (MFA), and secure secret management systems.
4. Observability	Audit Everything	Providing the capability to understand the agent’s actions, decisions, and complete chain of reasoning.	Comprehensive logging (audit, inference, access), data lineage tracking, monitoring systems, and complete traceability to enable forensic analysis and debugging.

1. Lifecycle Management: The Path to Production

The principle of Separation of Duties is paramount here. No single team or individual should have unilateral control over an agent’s deployment. This pillar mandates distinct, isolated environments (Development, Staging, Production) and rigorous change management processes. Changes must move systematically through these environments, with mandatory review and testing at each stage. The ability to instantly roll back a deployment is a non-negotiable requirement for autonomous systems. This ensures that every change is reviewed, tested, and approved in a controlled manner, preventing the introduction of vulnerabilities or unintended behavior into the production environment.

2. Risk Management: Building Resilient Systems

Defense in Depth is the core strategy for managing risk. This means employing multiple, overlapping layers of protection. If one layer—such as a prompt injection filter—fails, another layer—such as a behavioral guardrail preventing external API calls—should catch the problem. This includes proactive measures like continuous data quality monitoring, PII detection to prevent data leakage, and compliance checks to ensure the agent’s actions align with regulatory mandates. Behavioral guardrails, in particular, are crucial for agents, as they define the boundaries of acceptable action and can halt an agent’s execution if it attempts to perform a high-risk or unauthorized task.

3. Security: Minimizing the Blast Radius

The guiding principle of Least Privilege Access is crucial for autonomous agents. Every user, and the agent itself (via its service principal), should possess only the minimum permissions necessary to perform its function. This limits the potential damage from both accidental errors and malicious attacks. Implementing granular, role-based access control (RBAC) for all tools, data sources, and APIs the agent can access is essential. Furthermore, the agent’s identity and credentials must be managed with the same rigor as any human or system administrator, utilizing secure secret management systems and strong authentication protocols.

4. Observability: The Forensic Imperative

For autonomous agents, Audit Everything is the only acceptable standard. Observability goes beyond simple application logs; it requires capturing the agent’s entire chain of reasoning. Every interaction, tool use, data access, and decision point must be logged and traceable. This comprehensive logging is not just for debugging; it is a forensic imperative for compliance, security incident response, and understanding why an agent chose a particular course of action. Standards like OpenTelemetry can provide a foundation, but a full agent governance platform must offer deeper lineage tracking, allowing for the complete reconstruction of any agent’s activity timeline.

The Emerging Regulatory Landscape

As AI agents move from research labs to the enterprise, regulatory bodies are adapting existing frameworks to address their unique risks. Organizations must align their governance frameworks with these global standards.

The NIST AI Risk Management Framework (AI RMF)

The National Institute of Standards and Technology (NIST) AI RMF provides a voluntary, non-sector-specific framework for managing risks associated with AI systems. For AI agents, the AI RMF is particularly relevant because it emphasizes a continuous, lifecycle-based approach to risk management.

The core functions of the AI RMF—Govern, Map, Measure, and Manage—apply directly to the four pillars of agent governance:

Govern: Establishes the culture of risk management, aligning with the Lifecycle Management and Security pillars.
Map: Identifies and analyzes AI risks, directly supporting the Risk Management pillar.
Measure: Quantifies the risks and evaluates controls, providing the metrics needed for the Observability pillar.
Manage: Allocates resources and implements risk controls, ensuring the continuous operation of the entire governance framework.

The EU AI Act

The European Union’s AI Act is the world’s first comprehensive legal framework for AI, adopting a risk-based approach that has significant implications for AI agents. The Act classifies AI systems into four risk categories: Unacceptable, High, Limited, and Minimal.

For AI agents, the key implications are:

High-Risk Classification: Many enterprise AI agents, especially those used in critical areas like employment, credit scoring, or public services, will likely fall under the High-Risk category. This mandates strict compliance requirements, including quality management systems, logging capabilities, transparency, and human oversight.
General-Purpose AI (GPAI) Models: Since most agents are built on top of powerful GPAI models (like large language models), the providers of these foundational models must also comply with specific transparency and risk mitigation requirements, especially if the model is deemed to pose a systemic risk.
Four Pillars of Governance: The EU AI Act governs agents through four primary pillars: risk assessment, transparency tools, technical deployment controls, and human oversight design. This regulatory structure reinforces the need for the technical controls outlined in the four pillars of agent governance.

Best Practices for Implementation

Implementing an effective AI agent governance framework requires a cultural and technical shift. Here are key best practices:

Integrate Governance from Day One: Treat governance as a core architectural requirement, not a post-development task. While it may add an initial 20-30% to the development time, it dramatically reduces the total time and cost required to safely deploy to production by preventing costly rework and security incidents.
Define Clear Decision Boundaries: Explicitly set the scope of the agent’s autonomy. For any action that is high-risk, irreversible, or outside a predefined boundary, the agent must have an established escalation protocol—a mechanism to pause, flag the action, and seek human review or approval.
Establish Shared Responsibility: Agent governance is not solely the domain of the security or compliance team. It requires a collaborative structure involving AI developers, MLOps engineers, security officers, legal counsel, and business stakeholders, with clear ownership defined for each of the four pillars.
Implement Continuous Adaptation: The governance framework must be as dynamic as the agents it oversees. Conduct formal quarterly reviews, but also implement continuous monitoring to adapt policies and controls as new risks emerge, regulations change, and the agent’s capabilities evolve.

Conclusion: From Prototype to Production-Ready

The move to autonomous AI agents is inevitable, but their safe and responsible deployment is not. The difference between a fragile prototype and a robust, trustworthy system is a comprehensive AI Agent Governance Framework.

Investing in the four pillars—Lifecycle Management, Risk Management, Security, and Observability—is not a cost center; it is a strategic investment that accelerates safe deployment and prevents catastrophic failure. The question for every organization is no longer if they will build an AI agent, but how they will govern it.

November 20, 2025

Mastering Parallel AI Agents: Turbocharge Your Workflows with Concurrent Execution

The landscape of artificial intelligence is rapidly evolving, moving beyond single-query interactions to sophisticated, multi-step workflows orchestrated by AI agents. These agents, powered by large language models (LLMs), can autonomously plan, use tools, and iterate to solve complex problems. However, as the complexity of tasks increases, a critical bottleneck emerges: sequential processing. A workflow where Agent A must wait for Agent B, which in turn waits for Agent C, quickly becomes inefficient and slow.

This challenge has given rise to the paradigm of Parallel AI Agents. In this model, multiple agents or sub-tasks are executed simultaneously, working in concert to tackle a single, overarching goal. This approach is not merely an optimization; it is the key to unlocking the true potential of multi-agent systems, leading to faster, more robust, and more complex AI applications. This post will explore the architecture, benefits, and provide practical code examples across three popular frameworks: LangGraph, Google’s Agent Development Kit (ADK), and CrewAI.

The Power of Parallelism: Why It Matters

The shift from sequential to parallel execution offers profound advantages that fundamentally change the performance profile of agentic systems.

Speed and Efficiency

The most immediate benefit is the dramatic reduction in latency. For tasks like comprehensive research or complex code generation, where multiple sources or components need to be analyzed, running these steps concurrently can cut the total execution time by a factor proportional to the number of parallel branches. This is crucial for applications requiring near real-time responsiveness.

Handling Complexity

Parallelism allows for the effective decomposition of large, complex problems. A single, monolithic task can be broken down into smaller, manageable sub-tasks that are inherently independent. For example, a financial analysis task can simultaneously dispatch agents to gather stock data, read news sentiment, and analyze quarterly reports. This divide-and-conquer strategy simplifies the design of individual agents while enabling the system to handle a far greater degree of complexity.

Robustness and Cross-Validation

A parallel architecture inherently introduces a mechanism for cross-validation and increased robustness. By having multiple agents approach the same problem from different angles—perhaps using different tools or different LLM prompts—the system can compare and reconcile their findings. This redundancy helps to mitigate the risk of a single agent hallucinating or failing, leading to a more reliable final output.

Advanced Parallelism Concepts

While simple parallel execution is powerful, advanced multi-agent systems leverage more sophisticated concepts:

Conditional Parallelism: The ability to dynamically decide which branches of a workflow should run in parallel based on the outcome of a preceding step. For example, a planning agent might decide to run two research agents in parallel only if the initial search query is ambiguous.
Dynamic Forking: Creating an arbitrary number of parallel agents at runtime. This is crucial for tasks like processing a list of documents or analyzing a batch of data points, where the number of parallel tasks is not fixed beforehand.
Shared Memory Management: In complex scenarios, parallel agents may need to read from and write to a shared state simultaneously. This requires careful implementation of concurrency controls (like locks or atomic operations) to prevent race conditions and ensure data integrity. Frameworks abstract this complexity, but understanding the underlying mechanism is key to debugging and scaling.

Architectural Deep Dive: Sequential vs. Parallel MAS

A Multi-Agent System (MAS) is defined by its core components: the agents (LLMs with reasoning capabilities), the tools they can use, the memory that maintains state, and the orchestrator that manages the flow. The difference between a sequential and a parallel MAS lies entirely in the orchestration layer.

Feature	Sequential Multi-Agent System	Parallel Multi-Agent System
Execution Flow	Linear: Agent A → Agent B → Agent C	Concurrent: Agent A and Agent B run at the same time
Dependency	High: Each step depends on the output of the previous one.	Low: Branches are independent until the merge point.
Latency	High: Sum of all agent execution times.	Low: Determined by the longest-running parallel branch.
Use Case	Step-by-step refinement (e.g., Plan, Execute, Review).	Information gathering, cross-validation, simultaneous task execution.

The Orchestrator is the central component that enables parallelism. It must be capable of forking, state management, and joining.

Practical Implementation: A Multi-Framework Approach

The concept of parallel agents is implemented differently across various frameworks, but the core principle remains the same: execute independent tasks concurrently. We will explore how this is achieved in three major frameworks: LangGraph, Google ADK, and CrewAI.

1. LangGraph: Graph-Based Parallelism

The LangGraph framework, an extension of LangChain, is perfectly suited for building these stateful, multi-actor applications. It models the workflow as a graph, where nodes are agents or functions, and edges define the flow of execution. The key to implementing parallelism in LangGraph is to define multiple outgoing edges from a single node (or the START node) to the nodes that should run concurrently.

Code Sample: Parallel Agents with LangGraph

This example runs two distinct agents in parallel to answer a single query and then merges their findings .

import os
from typing import TypedDict
from dotenv import load_dotenv
from langchain_openai import ChatOpenAI
from langchain_google_genai import ChatGoogleGenerativeAI # For direct Gemini usage
from langchain.agents import AgentExecutor, create_tool_calling_agent
from langchain_core.prompts import PromptTemplate
from langchain_community.tools import WikipediaQueryRun, DuckDuckGoSearchRun
from langchain_community.utilities import WikipediaAPIWrapper
from langgraph.graph import StateGraph, END, START

# --- 1. Define Graph State (Shared Memory) ---
class GraphState(TypedDict):
    input: str
    agent_a_result: str
    agent_b_result: str
    output: str

# --- 2. Define Agent Nodes (Functions) ---
# Note: In a real implementation, you would initialize the LLM here:
# llm = ChatGoogleGenerativeAI(model="gemini-2.5-flash") 
# and use it within the run_agent_a/b functions.

def run_agent_a(state: GraphState) -> dict:
    # ... Agent A execution logic using the LLM ...
    return {"agent_a_result": "Result from Agent A (e.g., historical data)"}

def run_agent_b(state: GraphState) -> dict:
    # ... Agent B execution logic using the LLM ...
    return {"agent_b_result": "Result from Agent B (e.g., current events)"}

def merge_results(state: GraphState) -> dict:
    # ... Merging logic, potentially using a final LLM call ...
    return {"output": f"Merged: {state['agent_a_result']} and {state['agent_b_result']}"}

# --- 3. Build the LangGraph ---
workflow = StateGraph(GraphState)
workflow.add_node("run_agent_a", run_agent_a)
workflow.add_node("run_agent_b", run_agent_b)
workflow.add_node("merge_results", merge_results)

# The key to parallelism: Add two edges from START
workflow.add_edge(START, "run_agent_a")
workflow.add_edge(START, "run_agent_b")

# The key to joining: Add edges from parallel nodes to a single merge node
workflow.add_edge("run_agent_a", "merge_results")
workflow.add_edge("run_agent_b", "merge_results")
workflow.add_edge("merge_results", END)

app = workflow.compile()

Code Explanation:

GraphState: This TypedDict is the single source of truth. The parallel agents write to separate keys (agent_a_result, agent_b_result) to avoid conflicts.
Parallel Edges (START to Agents): By defining multiple outgoing edges from the START node, LangGraph automatically executes the connected nodes (run_agent_a and run_agent_b) concurrently.
Joining Edges (Agents to merge_results): The merge_results node has two incoming edges. LangGraph’s default behavior is to wait for all incoming edges to complete before executing the node, ensuring both parallel results are available for merging.

This graph-based approach provides explicit, visual control over the parallel flow, making it highly suitable for complex, multi-step agentic reasoning.

2. Google ADK: The `ParallelAgent`

Google’s Agent Development Kit (ADK) provides a dedicated ParallelAgent class, a workflow agent designed to execute its sub-agents concurrently. This is ideal for scenarios where tasks are independent and speed is critical.

Code Sample: Parallel Web Research with Google ADK

This example sets up two specialized research agents and runs them in parallel, followed by a final synthesis agent

# Conceptual example based on ADK documentation.
from adk.agents.workflow_agents import ParallelAgent, SequentialAgent
from adk.agents.llm_agent import LlmAgent
from adk.tools import GoogleSearchTool # Example tool

# Define the model to be used
GEMINI_MODEL = "gemini-2.5-flash"

# --- 1. Define Researcher Sub-Agents (LlmAgents) ---
# Each agent is specialized and configured to store its result in a specific state key
researcher_agent_1 = LlmAgent(
    name="RenewableEnergyResearcher",
    model=GEMINI_MODEL, # Explicitly set the model
    instruction="Research the latest advancements in renewable energy.",
    tools=[GoogleSearchTool()],
    output_key="renewable_energy_result" # Key for result storage
)

researcher_agent_2 = LlmAgent(
    name="EVResearcher",
    model=GEMINI_MODEL, # Explicitly set the model
    instruction="Research the latest developments in electric vehicle technology.",
    tools=[GoogleSearchTool()],
    output_key="ev_technology_result"
)

# --- 2. Create the ParallelAgent (The Orchestrator) ---
parallel_research_agent = ParallelAgent(
    name="ParallelWebResearchAgent",
    sub_agents=[researcher_agent_1, researcher_agent_2],
    description="Runs multiple research agents in parallel."
)

# --- 3. Define the Merger Agent (Sequential Step) ---
merger_agent = LlmAgent(
    name="SynthesisAgent",
    model=GEMINI_MODEL, # Explicitly set the model
    instruction="Synthesize the findings from {renewable_energy_result} and {ev_technology_result} into a single report.",
    # Note: The input instruction implicitly uses the results stored by the parallel agents
)

# --- 4. Create the SequentialAgent to orchestrate the flow ---
# This ensures the parallel step runs first, followed by the merger.
sequential_pipeline_agent = SequentialAgent(
    name="ResearchAndSynthesisPipeline",
    sub_agents=[parallel_research_agent, merger_agent]
)

Code Explanation:

model=GEMINI_MODEL: The model is explicitly defined for each LlmAgent, ensuring the fast and efficient gemini-2.5-flash is used for the parallel research tasks.
LlmAgent with output_key: Each sub-agent is an LlmAgent that, upon completion, automatically writes its final output to the shared session state using the specified output_key.
ParallelAgent: This non-LLM workflow agent is the core of the parallelism. It simply takes its list of sub_agents and executes them all concurrently. It waits for all of them to finish before it is considered complete.
SequentialAgent: This agent is used to enforce the join operation. By placing the parallel_research_agent first and the merger_agent second, the system guarantees that the merger only runs after the parallel step has completed and all results have been written to the shared state.

This clear separation of concerns—parallel execution handled by ParallelAgent and sequential flow control by SequentialAgent—makes ADK workflows highly modular.

3. CrewAI: Asynchronous Task Execution

CrewAI achieves parallelism at the task level through asynchronous execution. Any task can be marked for concurrent execution by setting async_execution=True.

Code Sample: Parallel Tasks with CrewAI

This example shows two tasks that will run concurrently, followed by a final sequential task that synthesizes the results.

# Conceptual example based on CrewAI documentation.
from crewai import Agent, Task, Crew, Process
from crewai_tools import SerperDevTool # Example tool

# --- 1. Define Agent ---
researcher = Agent(
    role='Senior Research Analyst',
    goal='Gather and synthesize market and competitor data for a new product launch.',
    backstory='Expert in rapid, multi-source data collection and executive summary writing.',
    llm="gemini-2.5-flash", # Explicitly set the model
    tools=[SerperDevTool()],
    verbose=True
)

# --- 2. Define Parallel Tasks ---
market_task = Task(
    description='Analyze the current market trends for AI agents, focusing on growth rate and key adoption drivers.',
    agent=researcher,
    expected_output='A 3-point summary of current market trends and growth projections.',
    async_execution=True # KEY: Marks this task for concurrent execution
)

competitor_task = Task(
    description='Identify and summarize the top 3 competitors in the AI agent space, detailing their core product and market share.',
    agent=researcher,
    expected_output='A list of 3 competitors with a one-sentence description and estimated market share.',
    async_execution=True # KEY: Marks this task for concurrent execution
)

# --- 3. Define Sequential Synthesis Task (The Join) ---
synthesis_task = Task(
    description='Synthesize the market and competitor analysis into a single executive summary report for the CEO.',
    agent=researcher,
    expected_output='A final executive summary report, formatted in Markdown.',
    # KEY: The context explicitly tells the agent to wait for the results of the parallel tasks
    context=[market_task, competitor_task] 
)

# --- 4. Form the Crew ---
research_crew = Crew(
    agents=[researcher],
    tasks=[market_task, competitor_task, synthesis_task],
    process=Process.sequential # The tasks are processed in the order they are listed
)

# To run the crew:
# result = research_crew.kickoff()

Code Explanation:

llm="gemini-2.5-flash": The model is explicitly set in the Agent definition, ensuring the fast and efficient gemini-2.5-flash is used for all tasks executed by this agent.
async_execution=True: This is the core mechanism for parallelism in CrewAI. When the crew is kicked off, any task marked with this flag will be executed concurrently.
context=[market_task, competitor_task]: This is the explicit join mechanism. The synthesis_task will not begin until the tasks listed in its context have completed. The agent assigned to the synthesis task will then use the outputs of the parallel tasks as its input.

This task-centric approach to parallelism is highly intuitive for defining collaborative workflows where multiple pieces of information need to be gathered before a final, single action is taken.

Conclusion

The future of AI automation is collaborative and concurrent. By understanding the different approaches to parallelism offered by frameworks like LangGraph, Google ADK, and CrewAI, developers can select the right tool for the job. Whether it’s the explicit graph-based forking of LangGraph, the dedicated ParallelAgent of ADK, or the asynchronous task execution of CrewAI, mastering concurrent execution is the essential next step in building the next generation of intelligent, high-performance AI applications. Start experimenting with parallel agents today to turbocharge your workflows.

November 19, 2025

Building Sequential AI Agents: A Guide to Predictable and Scalable Automation

In the rapidly evolving landscape of artificial intelligence, a new architectural paradigm is gaining prominence: the sequential AI agent. This approach, which involves chaining together specialized AI agents in a predefined order, offers a powerful way to build predictable, scalable, and maintainable AI systems. By breaking down complex tasks into a series of manageable steps, each handled by a dedicated agent, developers can create robust workflows that deliver high-quality results with greater reliability.

This blog post provides a comprehensive guide to building sequential AI agents. We will explore the core concepts behind this architectural pattern, discuss various implementation strategies using popular frameworks, and share best practices for designing, testing, and deploying these systems in production. Whether you are a seasoned AI developer or just starting your journey, this guide will equip you with the knowledge and tools you need to harness the power of sequential AI agents for your own projects.

The Core of Sequential Agents: Predictability and Specialization

At its heart, the sequential AI agent pattern is about creating a predictable and structured workflow. It draws inspiration from the classic “pipes and filters” architecture in software engineering, where data flows through a series of components, each performing a specific transformation. In the context of AI, these components are specialized agents, each with a clearly defined role and a specific set of capabilities.

This approach stands in contrast to monolithic, single-agent systems, which can become unwieldy and difficult to manage as task complexity increases. By breaking down a large problem into smaller, more manageable sub-tasks, sequential agents offer several key advantages:

Advantage	Description
Modularity	Each agent is a self-contained unit with a specific responsibility, making it easier to develop, test, and maintain.
Specialization	Agents can be optimized for specific tasks, leading to higher-quality outputs and more efficient use of resources.
Predictability	The workflow is deterministic, with a clear and explicit sequence of steps, which simplifies debugging and troubleshooting.
Scalability	New agents can be easily added to the pipeline to extend its functionality, without requiring a complete redesign of the system.

This modular and specialized approach not only improves the reliability of AI systems but also makes them more transparent and easier to understand. By observing the output of each agent in the sequence, developers can gain valuable insights into the system’s decision-making process and identify areas for improvement.

Architecture and Implementation: A Tale of Three Frameworks

While the concept of sequential agents is straightforward, the implementation can vary significantly depending on the chosen framework. To illustrate this, we will explore how to build a simple code generation pipeline using three popular frameworks: LangGraph, Google’s Agent Development Kit (ADK), and CrewAI. Our pipeline will consist of three agents:

Code Writer Agent: Generates the initial Python code based on a user’s request.
Code Reviewer Agent: Reviews the generated code for errors and suggests improvements.
Code Refactorer Agent: Refactors the code based on the reviewer’s feedback.

LangGraph: The Composable Approach

LangGraph, a library from LangChain, provides a flexible and composable way to build agentic workflows. It uses a state-based approach, where each agent is a node in a graph that modifies a shared state object. This makes it easy to create complex, multi-agent systems with clear data flow.

Here is a simplified example of how to implement our code pipeline using LangGraph:

from typing import TypedDict, List
from langchain_openai import ChatOpenAI
from langgraph.graph import StateGraph, START, END

# Define a more detailed state to pass between agents
class CodeGenerationState(TypedDict):
    user_request: str
    generated_code: str
    review_comments: str
    refactored_code: str

# Initialize the language model
llm = ChatOpenAI(model="gemini-2.5-flash")

# Code Writer Agent: Generates the initial code.
def run_code_writer(state: CodeGenerationState) -> CodeGenerationState:
    prompt = f"You are a senior Python developer. Based on the user's request, write the initial Python code. Request: {state["user_request"]}"
    response = llm.invoke(prompt)
    return {"generated_code": response.content}

# Code Reviewer Agent: Reviews the code from the writer.
def run_code_reviewer(state: CodeGenerationState) -> CodeGenerationState:
    prompt = f"You are a code reviewer. Review the following Python code and provide constructive feedback on errors, style, and improvements. Code: {state["generated_code"]}"
    response = llm.invoke(prompt)
    return {"review_comments": response.content}

# Code Refactorer Agent: Refactors the code based on the review.
def run_code_refactorer(state: CodeGenerationState) -> CodeGenerationState:
    prompt = f"You are a code refactorer. Take the original code and the review comments, then refactor the code to address the feedback. Original Code: {state["generated_code"]}nReview Comments: {state["review_comments"]}"
    response = llm.invoke(prompt)
    return {"refactored_code": response.content}

# Build the graph
graph = StateGraph(CodeGenerationState)
graph.add_node("writer", run_code_writer)
graph.add_node("reviewer", run_code_reviewer)
graph.add_node("refactorer", run_code_refactorer)

# Connect the nodes in a sequence
graph.add_edge(START, "writer")
graph.add_edge("writer", "reviewer")
graph.add_edge("reviewer", "refactorer")
graph.add_edge("refactorer", END)

# Compile the graph into a runnable application
app = graph.compile()

In this example, each agent is a node in the StateGraph, and the edges define the sequential flow of the workflow. The AgentState object is passed from one agent to the next, allowing them to share information and build upon each other’s work.

Google’s Agent Development Kit (ADK): The Programmatic Approach

Google’s ADK offers a more programmatic and structured way to build sequential agents. It provides a SequentialAgent class that orchestrates a list of sub-agents in a predefined order. This approach is well-suited for production environments where reliability and explicit control are paramount.

Here is how our code pipeline can be implemented using ADK:

from google.adk.agents.sequential_agent import SequentialAgent
from google.adk.agents.llm_agent import LlmAgent

# Code Writer Agent: Writes the initial Python code.
code_writer_agent = LlmAgent(
    name="CodeWriterAgent",
    model="gemini-2.5-flash",
    instruction="""You are a senior Python developer.
    Based on the user's request, write the initial Python code.
    Output *only* the raw code block.""",
    output_key="generated_code"
)

# Code Reviewer Agent: Reviews the code and provides feedback.
code_reviewer_agent = LlmAgent(
    name="CodeReviewerAgent",
    model="gemini-2.5-flash",
    instruction="""You are a code reviewer.
    Review the Python code provided in the session state under the key 'generated_code'.
    Provide constructive feedback on potential errors, style issues, or improvements.""",
    output_key="review_comments"
)

# Code Refactorer Agent: Refactors the code based on review comments.
code_refactorer_agent = LlmAgent(
    name="CodeRefactorerAgent",
    model="gemini-2.5-flash",
    instruction="""You are a code refactorer.
    Take the original Python code from 'generated_code' and the review comments from 'review_comments'.
    Refactor the original code to address the feedback and improve its quality.""",
    output_key="refactored_code"
)

# Create the sequential agent that orchestrates the pipeline
code_pipeline_agent = SequentialAgent(
    name="CodePipelineAgent",
    sub_agents=[code_writer_agent, code_reviewer_agent, code_refactorer_agent]
)

The SequentialAgent in ADK handles the orchestration automatically, executing each sub-agent in the order they are provided in the list. It also manages the state, passing the InvocationContext from one agent to the next, which allows them to share data through a shared state mechanism.

CrewAI: The Declarative Approach

CrewAI provides a high-level, declarative API for building multi-agent systems. It focuses on making the development process more intuitive and human-readable, with a clear separation of concerns between agents, tasks, and processes. [4]

Here is the CrewAI implementation of our code pipeline:

from crewai import Crew, Process, Agent, Task

# Code Writer Agent: A senior developer who writes the initial code.
code_writer = Agent(
  role='Senior Python Developer',
  goal='Write clean, efficient, and functional Python code based on user specifications.',
  backstory='You are a seasoned Python developer with over 10 years of experience, specializing in writing robust and scalable code.',
  llm='gemini-2.5-flash'
)

# Code Reviewer Agent: A detail-oriented reviewer who checks for quality.
code_reviewer = Agent(
  role='Code Reviewer',
  goal='Thoroughly review Python code for bugs, style issues, and potential improvements.',
  backstory='You are a meticulous code reviewer with a keen eye for detail, ensuring all code meets the highest standards of quality and follows PEP 8 conventions.',
  llm='gemini-2.5-flash'
)

# Code Refactorer Agent: An optimization expert who improves the code.
code_refactorer = Agent(
  role='Code Refactorer',
  goal='Refactor Python code to implement reviewer suggestions and optimize for performance and readability.',
  backstory='You are an optimization expert who excels at transforming code to be more efficient, maintainable, and elegant.',
  llm='gemini-2.5-flash'
)

# Task for the Code Writer Agent
writing_task = Task(
  description='Write a Python function that implements: {user_request}.',
  agent=code_writer,
  expected_output='A single Python code block containing the complete function.'
)

# Task for the Code Reviewer Agent, which depends on the writer's output
review_task = Task(
  description='Review the code for bugs, style issues, and areas for improvement.',
  agent=code_reviewer,
  expected_output='A list of constructive comments and suggestions for the code.',
  context=[writing_task]
)

# Task for the Code Refactorer Agent, which depends on the previous tasks
refactor_task = Task(
  description='Refactor the code based on the provided review feedback.',
  agent=code_refactorer,
  expected_output='The final, refactored Python code block that incorporates all feedback.',
  context=[writing_task, review_task]
)

# Form the crew with a sequential process
code_crew = Crew(
  agents=[code_writer, code_reviewer, code_refactorer],
  tasks=[writing_task, review_task, refactor_task],
  process=Process.sequential
)

# Execute the crew with a sample request
result = code_crew.kickoff(inputs={"user_request": "Create a function to calculate the nth Fibonacci number recursively"})

With CrewAI, you define the agents and their corresponding tasks separately, and then assemble them into a Crew. By setting the process to Process.sequential, you instruct the crew to execute the tasks in the order they are provided. CrewAI handles the state management implicitly, allowing tasks to access the outputs of previous tasks in a contextual manner.

Best Practices and Design Patterns

Building robust and effective sequential agent systems requires more than just choosing the right framework. It also involves adhering to a set of best practices and understanding the broader landscape of agentic design patterns. This section provides practical guidance for designing, building, and deploying sequential agents in production.

Core Principles for Success

Based on insights from industry leaders like Anthropic and Google, several core principles have emerged for building successful agentic systems:

Start Simple: Begin with the simplest possible solution and only add complexity when necessary. Often, a well-designed prompt or a simple workflow is more effective than a complex multi-agent system.
Embrace Composability: Build your system from small, reusable components that can be easily combined and reconfigured. This modular approach makes your system more flexible and easier to maintain.
Prioritize Predictability: For many production use cases, predictability and reliability are more important than full autonomy. Sequential workflows, with their deterministic nature, are often the preferred choice for this reason.

Key Design Patterns for Sequential Agents

While the sequential pattern is powerful on its own, it can be combined with other design patterns to create even more sophisticated and robust systems. Here are some of the most relevant patterns to consider:

Pattern	Description
ReAct (Reason and Act)	Agents alternate between reasoning about a problem and taking action to solve it. This iterative approach allows agents to adapt to new information and changing conditions.
Reflection	Agents critique their own work, identifying errors and areas for improvement. This self-evaluation process can significantly improve the quality of the final output.
Planning	Agents create a detailed plan before they begin executing a task. This upfront planning helps to ensure that the agent has a clear path to success and can avoid potential pitfalls.
Tool Use	Agents are given access to external tools, such as APIs, databases, and code interpreters. This allows them to interact with the outside world and perform a wider range of tasks.
Human-in-the-Loop	At critical junctures in the workflow, the agent pauses and requests input from a human. This ensures that a human is always in control of high-stakes decisions.

By thoughtfully combining these patterns, you can create sequential agent systems that are not only predictable and scalable but also adaptive, robust, and safe.

Practical Recommendations for Implementation

Here are some practical recommendations to keep in mind as you build your sequential agent systems:

Define Clear Agent Specialization: Each agent in your pipeline should have a clear and focused role. This makes it easier to design, test, and debug your system.
Establish Explicit Data Flow: The data dependencies between your agents should be explicit and well-defined. This will help to prevent unexpected errors and make your system easier to understand.
Implement Robust Error Handling: Failures are inevitable in any complex system. Be sure to include mechanisms to handle failures at each step of your pipeline, such as retries, fallbacks, and notifications.
Test Each Agent Individually: Before you assemble your pipeline, be sure to test each agent individually to ensure that it is working as expected.
Monitor Your System in Production: Once your system is deployed, be sure to monitor its performance to identify any potential issues and ensure that it is meeting your expectations.

Conclusion: The Future is Sequential

Sequential AI agents represent a significant step forward in the development of reliable and scalable AI systems. By embracing a modular, predictable, and specialized approach, developers can build robust workflows that are easier to design, test, and maintain. While the allure of fully autonomous, generalist agents is strong, the practical reality is that for many real-world applications, the structured and deterministic nature of sequential agents offers a more pragmatic and effective path to success.

As the field of AI continues to evolve, we can expect to see even more sophisticated tools and frameworks for building sequential and other multi-agent systems. However, the core principles of modularity, specialization, and predictability will remain as important as ever. By mastering these principles and thoughtfully applying the design patterns and best practices discussed in this guide, you will be well-equipped to build the next generation of intelligent and reliable AI applications.

November 16, 2025

Category: Blog

Governing the Autonomous: A Deep Dive into AI Agent Governance Frameworks

The Urgency: Why AI Agents Demand a New Governance Model

Technical Risks Unique to Agentic Systems

The Four Pillars of AI Agent Governance

1. Lifecycle Management: The Path to Production

2. Risk Management: Building Resilient Systems

3. Security: Minimizing the Blast Radius

4. Observability: The Forensic Imperative

The Emerging Regulatory Landscape

The NIST AI Risk Management Framework (AI RMF)

The EU AI Act

Best Practices for Implementation

Conclusion: From Prototype to Production-Ready

Mastering Parallel AI Agents: Turbocharge Your Workflows with Concurrent Execution

The Power of Parallelism: Why It Matters

Speed and Efficiency

Handling Complexity

Robustness and Cross-Validation

Advanced Parallelism Concepts

Architectural Deep Dive: Sequential vs. Parallel MAS

Practical Implementation: A Multi-Framework Approach

1. LangGraph: Graph-Based Parallelism

Code Sample: Parallel Agents with LangGraph

2. Google ADK: The ParallelAgent

Code Sample: Parallel Web Research with Google ADK

3. CrewAI: Asynchronous Task Execution

Code Sample: Parallel Tasks with CrewAI

Conclusion

Building Sequential AI Agents: A Guide to Predictable and Scalable Automation

The Core of Sequential Agents: Predictability and Specialization

Architecture and Implementation: A Tale of Three Frameworks

LangGraph: The Composable Approach

Google’s Agent Development Kit (ADK): The Programmatic Approach

CrewAI: The Declarative Approach

Best Practices and Design Patterns

Core Principles for Success

Key Design Patterns for Sequential Agents

Practical Recommendations for Implementation

Conclusion: The Future is Sequential

2. Google ADK: The `ParallelAgent`