Timeline · p.3 — TECH Dashboard

Timeline page 3/16 · 459 total

TODAY 30 entries

NEW paper research 5h ago · arxiv-cs-ai

本論文は、LLMの推論能力は出力される思考連鎖(Chain of Thought)そのものではなく、潜在表現の中に存在すると主張する LLM Reasoning Is Latent, Not the Chain of Thought

AI要約本論文は、LLMの推論能力は出力される思考連鎖(Chain of Thought)そのものではなく、潜在表現の中に存在すると主張する。CoTテキストは内部の潜在的推論過程の表層的な痕跡に過ぎず、モデルの真の推論機構を理解するには潜在空間の分析が必要であると論じている。

EN This paper argues that LLM reasoning resides in latent representations rather than the explicit Chain of Thought output. The CoT text is merely a surface trace of deeper latent reasoning, suggesting that understanding true model reasoning requires analyzing latent spaces.

#arxiv #paper #llm #reasoning

arxiv.org →

fallback

NEW paper research 5h ago · arxiv-cs-ai

本論文は、代数的不変量を活用してLLMにアブダクション・演繹・帰納の構造化推論を行わせる手法を提案する Structured Abductive-Deductive-Inductive Reasoning for LLMs via Algebraic Invariants

AI要約本論文は、代数的不変量を活用してLLMにアブダクション・演繹・帰納の構造化推論を行わせる手法を提案する。仮説生成を不変量探索に帰着させ、演繹的検証と帰納的一般化を組み合わせることで、推論の一貫性と検証可能性を高めることを目指す。

EN This paper proposes a structured abductive-deductive-inductive reasoning framework for LLMs based on algebraic invariants, casting hypothesis generation as invariant discovery combined with deductive verification and inductive generalization to improve reasoning consistency and verifiability.

#arxiv #paper #llm-reasoning #abductive-reasoning

arxiv.org →

fallback

NEW paper research 5h ago · arxiv-cs-ai

KWBenchは、知識労働においてLLMが明示的な指示なしに問題を自発的に認識できるかを測定する新しいベンチマーク KWBench: Measuring Unprompted Problem Recognition in Knowledge Work

AI要約 KWBenchは、知識労働においてLLMが明示的な指示なしに問題を自発的に認識できるかを測定する新しいベンチマーク。実世界のタスクに潜む課題をモデルが気付けるかを評価し、従来の指示追従型評価を補完する。

EN KWBench is a new benchmark measuring whether LLMs can spontaneously recognize problems in knowledge work tasks without explicit prompting, complementing traditional instruction-following evaluations.

#arxiv #benchmark #paper #llm-evaluation

arxiv.org →

fallback

NEW paper research 5h ago · arxiv-cs-ai

本論文はブラックボックス組合せ最適化のためのStein変分推論に基づく新手法を提案する Stein Variational Black-Box Combinatorial Optimization

AI要約本論文はブラックボックス組合せ最適化のためのStein変分推論に基づく新手法を提案する。勾配情報が得られない離散探索空間において、粒子群を用いた分布近似で効率的に最適解を探索し、従来手法を上回る性能を示す。

EN This paper proposes a Stein variational inference approach for black-box combinatorial optimization, using particle-based distribution approximation to efficiently search discrete spaces without gradient information, outperforming prior methods.

#arxiv #paper #combinatorial-optimization #stein-variational

arxiv.org →

fallback

NEW paper research 5h ago · arxiv-cs-ai

Lean 4におけるハードモードの自動定理証明のためのオープンソースのエージェント型フレームワーク「Discover and Prove」を… Discover and Prove: An Open-source Agentic Framework for Hard Mode Automated Theorem Proving in Lean 4

AI要約 Lean 4におけるハードモードの自動定理証明のためのオープンソースのエージェント型フレームワーク「Discover and Prove」を提案。発見と証明を組み合わせた手法で、難易度の高い定理証明タスクに取り組む。

EN Proposes Discover and Prove, an open-source agentic framework for hard mode automated theorem proving in Lean 4, combining discovery and proving stages to tackle challenging theorem proving tasks.

#agent #arxiv #paper #lean4

arxiv.org →

fallback

NEW paper research 5h ago · arxiv-cs-ai

LLMエージェントの経験蓄積手法を「記憶・スキル・ルール」という圧縮度の異なるスペクトルとして統一的に捉える枠組みを提案する論文 Experience Compression Spectrum: Unifying Memory, Skills, and Rules in LLM Agents

AI要約 LLMエージェントの経験蓄積手法を「記憶・スキル・ルール」という圧縮度の異なるスペクトルとして統一的に捉える枠組みを提案する論文。各形式の抽象化レベルと適用場面を整理し、エージェント設計における経験活用の指針を示す。

EN This paper proposes a unified framework viewing experience accumulation in LLM agents as a compression spectrum spanning memory, skills, and rules, clarifying abstraction levels and use cases to guide agent design.

#agent #arxiv #paper #llm-agents

arxiv.org →

fallback

NEW paper research 5h ago · arxiv-cs-ai

本論文は、特徴量帰属による説明可能性に厳密な数学的基盤を与える試みを提案する Towards Rigorous Explainability by Feature Attribution

AI要約本論文は、特徴量帰属による説明可能性に厳密な数学的基盤を与える試みを提案する。従来のヒューリスティックな手法を超え、説明の正確性や一貫性を保証する形式的枠組みを構築し、信頼性のあるAI解釈を目指す。

EN This paper proposes a rigorous mathematical framework for explainability via feature attribution, moving beyond heuristic methods to provide formal guarantees on explanation correctness and consistency for trustworthy AI interpretation.

#arxiv #paper #explainability #feature-attribution

arxiv.org →

fallback