#leaderboard — TECH Dashboard

Entries page 1/1 · 3 total

Wed, May 27 1 entries

🔥 HOT blog tech-news 3w ago ·

microsoft-source

MAI-Image-2.5、Arenaのテキスト→画像リーダーボードで3位デビュー MAI-Image-2.5 launches at No. 3 on Arena text-to-image leaderboard

重要度 High High priority 重要度 High · 技術記事 · Industry & Policy High priority · technical post · Industry & Policy 公開 5月27日 Published May 27

AI要約 MicrosoftのMAI-Image-2.5がArenaテキスト→画像リーダーボードに初登場3位を記録し、同社の画像生成AI技術の競争力を示した。

EN The post MAI-Image-2.5 launches at No. 3 on Arena text-to-image leaderboard appeared first on Source .

#microsoft #news #image-generation +4

microsoft.ai →

fallback

Tue, Apr 21 1 entries

blog local-llm 1mo ago ·

huggingface-blog

QIMMA: 品質重視のアラビア語LLMリーダーボード公開 QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

重要度 Medium Medium priority 重要度 Medium · 技術記事 · Local LLM / Open Models Medium priority · technical post · Local LLM / Open Models 公開 4月21日 Published Apr 21

AI要約 TII (Technology Innovation Institute) がアラビア語LLM評価のための新リーダーボード「QIMMA」を公開した。品質を最優先に、文化的・言語的特性を反映したベンチマークでモデルを評価し、アラビア語圏での実用性を可視化する。

EN QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

#huggingface #open-model #arabic-nlp +4

huggingface.co →

fallback

Wed, Feb 4 1 entries

blog local-llm 4mo ago ·

huggingface-blog

Community Evals：ブラックボックスのリーダーボードより、コミュニティの評価を信頼する時代へ Community Evals: Because we're done trusting black-box leaderboards over the community

重要度 Medium Medium priority 重要度 Medium · 技術記事 · Local LLM / Open Models Medium priority · technical post · Local LLM / Open Models 公開 2月4日 Published Feb 4

AI要約 Hugging Faceがコミュニティ主導のLLM評価プラットフォーム「Community Evals」を発表。透明性と再現性を重視したオープンな評価エコシステムを目指す。

EN Community Evals: Because we're done trusting black-box leaderboards over the community

#huggingface #open-model #llm-evaluation +7

huggingface.co →

fallback

#leaderboard 3 total

Entries page 1/1 · 3 total

MAI-Image-2.5、Arenaのテキスト→画像リーダーボードで3位デビュー MAI-Image-2.5 launches at No. 3 on Arena text-to-image leaderboard

QIMMA: 品質重視のアラビア語LLMリーダーボード公開 QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

Community Evals：ブラックボックスのリーダーボードより、コミュニティの評価を信頼する時代へ Community Evals: Because we're done trusting black-box leaderboards over the community