#activation-steering — TECH Dashboard

paper research 3w ago ·

arxiv-cs-cl

大規模言語モデルにおける潜在活性化ステアリングによる文化的価値観アライメント Cultural Value Alignment Via Latent Activation Steering in Large Language Models

重要度 Medium Medium priority 重要度 Medium · 論文/研究 · Papers / Benchmarks Medium priority · paper/research · Papers / Benchmarks 公開 5月27日 Published May 27

AI要約 LLMが示す均質な文化的偏りを、世界価値観調査(WVS)を基準として潜在空間の操作で修正する手法を提案した研究。

EN arXiv:2605.26365v1 Announce Type: new Abstract: Large Language Models (LLMs) often exhibit homogenized cultural perspectives. While the World Values Survey (WVS) provides a gold standard for mapping h

#arxiv #paper #cultural-alignment +4

arxiv.org →

og fallback

#activation-steering 1 total

Entries page 1/1 · 1 total

大規模言語モデルにおける潜在活性化ステアリングによる文化的価値観アライメント Cultural Value Alignment Via Latent Activation Steering in Large Language Models