GitHub Copilot ⚠ 古い情報の可能性

AIエージェントに仕事を任せる前に「実行契約」を書く: ツール・権限・停止条件をコード化する実務入門 AIエージェントに仕事を任せる前に「実行契約」を書く: ツール・権限・停止条件をコード化する実務入門

Qiita GitHub Copilot tag · qiita.com · 2026/06/01 21:53 · 2w ago · 📖 2 min

AI 3 行サマリ

結論: これから大事になるのは「プロンプト」より、実行前の契約かもしれない最近のAI開発ツールを見ていると、流れがかなり変わってきたなと思います。
コード補完からチャットへ。
チャットからエージェントへ。
さらに今は、ローカルアプリ、ブラウザ

AIエージェントが実際の業務タスクを自律的にこなす時代が近づく中、「どう指示するか」よりも「どんな権限で、どこまで動かすか」を事前に決めておくことの重要性が増している。

従来のAI活用は、チャットでの対話やコード補完が中心だった。しかし最近はローカル環境やブラウザ、外部APIをまたいで自律動作するエージェント型の利用が増えており、一度動き出したエージェントが意図しない操作を行うリスクも現実のものとなってきた。こうした背景から生まれたのが「実行契約（Execution Contract）」という概念だ。

実行契約とは、エージェントに仕事を委任する前に、使用を許可するツールの種類、アクセスできるリソースの範囲、処理を中断・停止すべき条件などをコードや設定として明示的に定義しておくアプローチを指す。自然言語のプロンプトだけに頼るのではなく、実行環境の制約をコードレベルで埋め込むことで、エージェントの暴走や想定外の副作用を未然に防ごうという考え方だ。

MCP（Model Context Protocol）のようなエージェント間通信の標準化が進む現在、ツールの呼び出しインターフェースが統一されつつある。これにより、どのツールを「呼び出せる状態にするか」をコードで制御することが技術的に現実的になった。実行契約はこのトレンドと親和性が高く、MCPサーバーの設定やシステムプロンプトの構造化と組み合わせることで、より堅牢なエージェント運用が実現できると見られる。

結論: これから大事になるのは「プロンプト」より、実行前の契約かもしれない最近のAI開発ツールを見ていると、流れがかなり変わってきたなと思います。

🧠 GitHub Copilot · 本記事のポイント

停止条件の設計も重要なポイントだ。「ファイルを削除しようとした場合」「外部送信のバイト数が一定を超えた場合」など、具体的なトリガーをあらかじめ定義しておくことで、人間の監視コストを下げながら安全性を確保できる可能性がある。これはソフトウェアエンジニアリングにおける「フェイルセーフ設計」の思想をエージェント運用に応用したものといえる。

海外でも同様の議論は活発で、Anthropicが提唱する「Constitutional AI」やOpenAIの「Preparedness Framework」など、AIの行動範囲を事前定義する取り組みが企業レベルで進んでいる。個人開発者・小規模チームでも、今後はエージェントを動かす前に「実行契約」を書く習慣が標準的なプラクティスになっていく可能性がある。プロンプトエンジニアリングの次のフェーズとして注目しておきたい概念だ。

As AI agents become capable of handling real-world tasks autonomously, the question is shifting from 'how do we prompt them well?' to 'what are they actually allowed to do, and when should they stop?' This reframing — defining tool access, permissions, and stopping conditions in code before an agent starts running — is what the author calls an 'execution contract.'

The trajectory of AI tooling has moved quickly: from code completion to chat interfaces, from chat to agentic workflows that can span local filesystems, browsers, and external APIs. With that autonomy comes risk. An agent that misinterprets a task, or is given broader access than necessary, can cause unintended side effects that are hard to roll back. The execution contract concept is a direct response to this challenge.

Rather than relying solely on natural-language prompts to guide behavior, an execution contract explicitly encodes constraints at the system level: which tools the agent may invoke, which resources it can read or write, and what conditions should trigger a pause or full stop. Think of it as a formalized pre-flight checklist — written not for humans to read aloud, but for the runtime environment to enforce.

This approach aligns naturally with the rise of the Model Context Protocol (MCP), which is standardizing how AI agents discover and call tools. When tool invocation interfaces are uniform, it becomes technically tractable to control which tools are even exposed to an agent in the first place. Combining MCP server configuration with structured system prompts and explicit permission schemas gives developers a layered defense against unintended behavior.

Stopping conditions deserve particular attention. Defining triggers such as 'halt if the agent attempts to delete a file,' 'pause if outbound data exceeds a threshold,' or 'require human confirmation before any write to a production database' borrows directly from fail-safe design patterns in traditional software engineering. Applied to agents, these guards can reduce the need for constant human oversight without sacrificing safety.

The broader industry is converging on similar ideas. Anthropic's Constitutional AI work, OpenAI's Preparedness Framework, and Google DeepMind's safety research all grapple with bounding AI behavior before it executes. What this Qiita article contributes is a ground-level, practitioner-oriented framing: individual developers and small teams can and should adopt execution-contract thinking now, before agentic workflows become too complex to govern retroactively.

The implication is that prompt engineering, while still valuable, may become a secondary concern. The primary discipline for reliable agent deployment could well be contract engineering — specifying, in code, the exact envelope within which an agent is permitted to act. For anyone building with MCP or similar agentic stacks, this is a mental model worth internalizing sooner rather than later.