#efficiency — TECH Dashboard

paper research 3w ago ·

arxiv-cs-ai

どれだけ考えれば十分か？LLM推論における冗長性の定量化と理解 How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning

重要度 Medium Medium priority 重要度 Medium · 論文/研究 · Papers / Benchmarks Medium priority · paper/research · Papers / Benchmarks 公開 5月26日 Published May 26

AI要約 LLMの長い思考チェーンに含まれる冗長性を定量化し、レイテンシ・GPU時間・エネルギーコストを削減する手法を研究した論文。

EN A research paper quantifying redundancy in LLM chain-of-thought reasoning, aiming to reduce latency, GPU time, and energy costs without sacrificing accuracy.

#arxiv #paper #chain-of-thought +4

arxiv.org →

og fallback

blog tech-news 3w ago ·

microsoft-source

小型モデル向けに最適化された、より賢いAIエージェント Smarter AI agents, built to run on smaller models

重要度 Medium Medium priority 重要度 Medium · 技術記事 · Industry & Policy Medium priority · technical post · Industry & Policy 公開 5月23日 Published May 23

AI要約 MicrosoftがMagenticLite・MagenticBrain・FARA 1.5を発表。小型モデルで動作するエージェント体験を実現する研究成果。

EN The post Smarter AI agents, built to run on smaller models appeared first on Source .

#microsoft #news #ai-agents +5

microsoft.com →

fallback

#efficiency 2 total

Entries page 1/1 · 2 total

どれだけ考えれば十分か？LLM推論における冗長性の定量化と理解 How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning

小型モデル向けに最適化された、より賢いAIエージェント Smarter AI agents, built to run on smaller models