#dataset — TECH Dashboard

Entries page 1/1 · 4 total

TODAY 1 entries

NEW paper research 3h ago ·

arXiv cs.AI

Nigeria Machinery: ドメイン根拠推論層を備えた低リソース産業データセット Nigeria Machinery: A Low-Resource Industrial Dataset with a Domain-Grounded Reasoning Layer

重要度 Medium Medium priority 重要度 Medium · 論文/研究 · Papers / Benchmarks Medium priority · paper/research · Papers / Benchmarks 公開 7月11日 Published Jul 11

AI要約ナイジェリアの機械産業を対象とした低リソースNLPデータセットを構築し、ドメイン知識に基づく推論層を導入することで、資源の乏しい産業分野におけるAI応用の課題に取り組んでいる。

EN This paper introduces a low-resource industrial dataset focused on Nigerian machinery, augmented with a domain-grounded reasoning layer to improve AI performance in underrepresented industrial settings.

#africa #arxiv #dataset +4

arxiv.org →

fallback

Sun, Jun 21 1 entries

blog tech-news 2w ago ·

The Verge

The Atlantic、AIの学習に使われた音楽の検索可能なデータベースを公開 The Atlantic created a searchable database of the music used to train AI

通常 Normal 深掘り候補 · 技術記事 · Industry & Policy Deep-dive candidate · technical post · Industry & Policy 公開 6月21日 Published Jun 21

AI要約 The Atlanticの記者Alex Reisnerが、AIモデルの学習に使われた音楽データセット4件を発見し、一般向けに検索可能な形で公開した。うち2件は1200万件超の大規模データで、音楽の著作権とAI学習の透明性をめぐる議論に影響を与える。

EN The Atlantic's Alex Reisner uncovered four datasets of music used to train AI models and made them publicly searchable, with two exceeding 12 million entries, fueling copyright and transparency debates.

#ai-training-data #alex-reisner #copyright +7

theverge.com →

The Atlantic created a searchable database of the music used to train AI

og fallback

Mon, Jun 1 1 entries

paper research 1mo ago ·

arXiv cs.LG

QASM-Eval: OpenQASM-3 対応 LLM の訓練・評価用データセット QASM-Eval: A Dataset to Train and Evaluate LLMs on OpenQASM-3 Beyond Quantum Circuits

通常 Normal 深掘り候補 · 論文/研究 · Papers / Benchmarks Deep-dive candidate · paper/research · Papers / Benchmarks 公開 6月1日 Published Jun 1

AI要約量子プログラミング言語 OpenQASM-3 を題材に、LLM の理解・生成能力を訓練・評価するデータセット「QASM-Eval」を提案。量子回路の枠を超えた幅広いタスクをカバーし、NISQ 時代の量子ソフト開発での LLM 活用を後押しする。

EN QASM-Eval introduces a dataset for training and evaluating LLMs on the OpenQASM-3 quantum programming language, covering tasks beyond quantum circuits to advance LLM-assisted quantum software development in the NISQ era.

#arxiv #benchmark #code-generation +8

arxiv.org →

fallback

Tue, Sep 16 1 entries

NEW blog local-llm 9mo ago ·

Hugging Face Blog

LeRobotDataset v3.0: lerobot に大規模データセットを導入 `LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

重要度 Medium Medium priority 重要度 Medium · 技術記事 · Local LLM / Open Models Medium priority · technical post · Local LLM / Open Models 公開 9月16日 Published Sep 16

AI要約 LeRobotDataset v3.0 では大規模ロボティクスデータセットの効率的な管理・利用が可能になり、研究者が実機学習をスケールアップしやすくなった。

EN LeRobotDataset v3.0 introduces large-scale dataset support for the lerobot framework, making it significantly easier to manage and train on high-volume robot learning data.

#dataset #huggingface #lerobot +2

huggingface.co →

fallback

#dataset 4 total

Entries page 1/1 · 4 total

Nigeria Machinery: ドメイン根拠推論層を備えた低リソース産業データセット Nigeria Machinery: A Low-Resource Industrial Dataset with a Domain-Grounded Reasoning Layer

The Atlantic、AIの学習に使われた音楽の検索可能なデータベースを公開 The Atlantic created a searchable database of the music used to train AI

QASM-Eval: OpenQASM-3 対応 LLM の訓練・評価用データセット QASM-Eval: A Dataset to Train and Evaluate LLMs on OpenQASM-3 Beyond Quantum Circuits

LeRobotDataset v3.0: lerobot に大規模データセットを導入 `LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`