#llm-agent — TECH Dashboard

Entries page 1/1 · 5 total

Sun, Jun 28 1 entries

blog claude 1d ago ·

zenn-claude

LangChainなしでLLMエージェントを作る【Claude API + Python実装】 A practical tutorial showing how to build an LLM agent from scratch with Claude API and Py…

通常 Normal 深掘り候補 · 技術記事 · Claude / Claude Code Deep-dive candidate · technical post · Claude / Claude Code 公開 6月28日 Published Jun 28

AI要約 LangChainなどのフレームワークに依存せず、Claude APIとPythonのみを使ってLLMエージェントをゼロから実装する方法を解説した技術記事。エージェントの仕組みを根本から理解したい開発者にとって有益な実践的ガイドとなっている。

EN A practical tutorial showing how to build an LLM agent from scratch with Claude API and Python, skipping LangChain to give developers a clearer view of how agents actually work.

#claude #zenn #llm-agent +3

zenn.dev →

fallback

Sat, Jun 20 1 entries

NEW blog copilot 1w ago ·

github-blog-ai

社内データ分析エージェント「Qubot」の構築事例 How we built an internal data analytics agent

重要度 Medium Medium priority 重要度 Medium · 技術記事 · GitHub Copilot Medium priority · technical post · GitHub Copilot 公開 6月20日 Published Jun 20

AI要約 GitHubが社内向けに構築したCopilot搭載データ分析エージェント「Qubot」の事例を解説。社員が自然言語でデータを問い合わせできる仕組みと、構築過程で得た設計や運用上の知見を共有している。

EN GitHub shares how it built Qubot, an internal Copilot-powered analytics agent that lets employees query company data in plain language, detailing its architecture and the lessons learned.

#agent #github #copilot +7

github.blog →

fallback

Mon, Jun 1 1 entries

paper research 4w ago ·

arxiv-cs-lg

LongDS-Bench：長期的なエージェント型データ分析が失敗する理由を検証 LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis

重要度 Medium Medium priority 重要度 Medium · 論文/研究 · Papers / Benchmarks Medium priority · paper/research · Papers / Benchmarks 公開 6月1日 Published Jun 1

AI要約 LongDS-Benchは現実のデータ分析が持つ反復的で長期的な性質を再現した新ベンチマークで、短い単発タスクに偏った既存評価では見落とされていたAIエージェントの弱点を体系的に明らかにする。

EN LongDS-Bench is a new benchmark capturing the iterative, long-horizon nature of real-world data analysis, systematically exposing AI agent weaknesses that prior short-task evaluations overlooked.

#agent #arxiv #paper +8

arxiv.org →

fallback

Fri, May 29 2 entries

paper research 4w ago ·

arxiv-cs-cl

LCO: 実世界タスクにおけるエージェント型LLMの安全性向上のためのLLMベース制約最適化 LCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasks

重要度 Medium Medium priority 重要度 Medium · 論文/研究 · Papers / Benchmarks Medium priority · paper/research · Papers / Benchmarks 公開 5月29日 Published May 29

AI要約自律エージェントとして動作するLLMは環境との継続的な相互作用で安全性が損なわれる恐れがある。本論文はLLMベースの制約最適化フレームワーク「LCO」を提案し、実世界タスクでのより安全なエージェント挙動を実現する。

EN As agentic LLMs interact continuously with environments, safety can degrade. The paper proposes LCO, an LLM-based constraint optimization framework for safer agent behavior in real-world tasks.

#agent #arxiv #paper +7

arxiv.org →

og fallback

paper research 4w ago ·

arxiv-cs-cl

OralAgent: 推論・ツール・知識を統合したインタラクティブな歯科画像解析 OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis

重要度 Medium Medium priority 重要度 Medium · 論文/研究 · Papers / Benchmarks Medium priority · paper/research · Papers / Benchmarks 公開 5月29日 Published May 29

AI要約歯科画像解析における正確な診断と治療計画を支援するため、推論・外部ツール・専門知識を統合したエージェント型AIシステム「OralAgent」を提案した研究論文。対話的な画像解析を可能にする点が特徴である。

EN This research paper proposes OralAgent, an agent-based AI system that integrates reasoning, external tools, and domain knowledge to enable interactive dental image analysis, supporting more accurate diagnosis and treatment planning.

#arxiv #paper #dental-ai +8

arxiv.org →

og fallback

#llm-agent 5 total

Entries page 1/1 · 5 total

LangChainなしでLLMエージェントを作る【Claude API + Python実装】 A practical tutorial showing how to build an LLM agent from scratch with Claude API and Py…

社内データ分析エージェント「Qubot」の構築事例 How we built an internal data analytics agent

LongDS-Bench：長期的なエージェント型データ分析が失敗する理由を検証 LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis

LCO: 実世界タスクにおけるエージェント型LLMの安全性向上のためのLLMベース制約最適化 LCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasks

OralAgent: 推論・ツール・知識を統合したインタラクティブな歯科画像解析 OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis