Papers / Benchmarks ⚠ 古い情報の可能性

TrajAudit: エージェント型コーディングシステムの障害自動診断 TrajAudit: Automated Failure Diagnosis for Agentic Coding Systems

arXiv cs.SE · arxiv.org · 2026/05/27 13:00 · 3w ago · 📖 1 min

元記事を読む古い情報の可能性

AI 3 行サマリ

バグ修正などを行うエージェント型AIシステムの失敗原因を自動診断するフレームワーク「TrajAudit」を提案した研究論文。

English summary

TrajAudit is a proposed framework for automated failure diagnosis in agentic coding systems such as AI-driven bug fixers, helping explain why tasks go wrong.

エージェント型AIシステムはバグ修正などのソフトウェアエンジニアリング作業を自動化する手段として広く研究されている。しかし複雑なタスクに取り組む場合、どの時点でなぜ失敗したかを把握することが困難であり、改善の妨げとなっていた。

TrajAuditはエージェントの実行軌跡（トラジェクトリ）を解析し、障害箇所と原因を自動的に特定することを目的としたフレームワークである。arXiv:2605.26563として2026年5月に公開された。詳細な手法や評価結果は原論文を参照されたい。

Agentic AI systems designed to automate software engineering tasks—such as bug fixing—have attracted considerable research interest. However, as these systems handle increasingly complex tasks, diagnosing why and where they fail becomes a significant challenge, limiting iterative improvement.

TrajAudit addresses this gap by proposing an automated failure diagnosis framework that analyzes agent execution trajectories to pinpoint the step and root cause of failures. The paper was announced on arXiv (2605.26563) in late May 2026. Specific methodology details, benchmarks, and quantitative results are available in the full paper and should be consulted before drawing conclusions about the framework's scope or performance.

#agent #arxiv #paper #failure-diagnosis #software-engineering #trajectory-analysis #automated-debugging

SourcearXiv cs.SET1
Source Avg ★ 2.0
Type論文
Importance ★ 通常 (top 93% in Papers / Benchmarks)
Half-life 🏛️ 長期 (アーキテクチャ)
LangEN
Collected2026/05/27 20:00

元記事を読む

arxiv.org

本ページの本文・要約は AI による自動生成です。正確性は元記事 (arxiv.org) をご確認ください。

🔬 Papers / Benchmarks の他の記事 もっと見る →

🔬 Papers / Benchmarks の他の記事もっと見る →