LeRobot v0.5.0リリース、ロボット学習基盤を全方位で拡張 LeRobot v0.5.0: Scaling Every Dimension

Hugging Face Blog · huggingface.co · 2026/03/09 09:00 · 3mo ago · 📖 2 min

AI 3 行サマリ

Hugging FaceがオープンソースのロボットライブラリLeRobot v0.5.0を公開。
プロセッサパイプライン刷新、非同期推論、強化学習対応、SmolVLAなど最新方策、ハードウェア互換拡充を備え、研究から実機運用までを一貫支援する。

English summary

Hugging Face released LeRobot v0.5.0, a major update to its open-source robot library with a new processor pipeline, async inference, RL support, SmolVLA policies, and broader hardware compatibility.

Hugging Faceは、オープンソースのロボット学習ライブラリ「LeRobot」のメジャーアップデートとなるv0.5.0を公開した。模倣学習や強化学習をはじめとする多様な学習手法と、実機ロボットでの推論・運用を統合的に扱う基盤として、研究コミュニティと実応用の橋渡しを狙うものだ。

今回のリリースでは、データ前処理から方策推論、後処理までを一貫して扱う新しいプロセッサパイプラインが導入された。これにより観測・行動の変換処理が標準化され、学習時と推論時のミスマッチを抑えやすくなる。さらに非同期推論(async inference)が組み込まれ、高頻度制御を要するロボット制御で方策の遅延を緩和できる構成となっている。

方策面では、軽量なビジョン言語行動モデルであるSmolVLAをはじめ、ACTやDiffusion Policy、Pi0系などの最新手法に対するサポートが拡充された。加えて強化学習(RL)向けのワークフローも追加され、模倣学習で得た方策をRLで微調整するといった研究で広がる手法に対応する。Hugging Face Hubと統合されたデータセット形式LeRobotDatasetも改善され、大規模データの取り扱いがしやすくなった。

プロセッサパイプライン刷新、非同期推論、強化学習対応、SmolVLAなど最新方策、ハードウェア互換拡充を備え、研究から実機運用までを一貫支援する。

🏠 Local LLM / Open Models · 本記事のポイント

ハードウェア面では、SO-100/SO-101など低価格な教育用アームから、より本格的な双腕ロボットまで対応の幅が広がっている。これは、Hugging FaceがPollen RoboticsやReachy 2を通じて自社のロボットスタック整備を進めてきた流れと整合的だと見られる。

背景として、ロボット学習領域ではPhysical IntelligenceのPi0、GoogleのRT-2、NVIDIA GR00Tなど大規模VLAモデルの公開が相次いでおり、再現実装と評価基盤の重要性が増している。LeRobotはこれらを共通インターフェースで扱える「ロボット版transformers」を志向しており、v0.5.0はその方向性をさらに強める位置付けと言える。

Hugging Face has shipped LeRobot v0.5.0, a major update to its open-source robotics learning library that aims to be a common substrate for imitation learning, reinforcement learning, and real-world robot deployment. The release touches nearly every layer of the stack, from data handling to policies to hardware integration.

The headline change is a unified processor pipeline that standardises how observations and actions are transformed between datasets, training, and inference. In practice, mismatches between the preprocessing used at train time and at deploy time are a frequent source of silent failures in robot learning, so making this pipeline explicit and reusable should reduce a class of subtle bugs. Alongside this, async inference has been added, decoupling policy computation from the control loop so that high-frequency robots are not bottlenecked by model latency.

On the policy side, v0.5.0 expands support for state-of-the-art approaches including SmolVLA, a compact vision-language-action model developed within the LeRobot ecosystem, as well as ACT, Diffusion Policy and Pi0-style architectures. A reinforcement learning workflow has also been integrated, reflecting a broader research trend of using RL to fine-tune policies first bootstrapped via imitation learning. The LeRobotDataset format, tightly coupled with the Hugging Face Hub, has been refined to scale better to larger episodic datasets.

Hardware coverage continues to grow, spanning low-cost educational arms such as the SO-100 and SO-101 up to more capable bimanual setups. This dovetails with Hugging Face's acquisition of Pollen Robotics and ongoing work around the Reachy 2 humanoid, suggesting a deliberate strategy of co-developing software and accessible hardware rather than treating them as separate concerns.

The wider context matters here. The last year has seen a surge of large vision-language-action models, including Physical Intelligence's Pi0, Google DeepMind's RT-2 and Gemini Robotics work, and NVIDIA's GR00T initiative for humanoids. These systems are powerful but often hard to reproduce or compare, and evaluation in robotics remains notoriously inconsistent. LeRobot is positioning itself as something like a transformers library for robotics: a place where heterogeneous policies, datasets and embodiments can be expressed through shared abstractions and benchmarked on common ground.

Whether the project can sustain that ambition will depend on community uptake and on how cleanly the new processor and async-inference abstractions hold up as more exotic robots and policies are added. For now, v0.5.0 looks like a substantive step in that direction, and it is likely to lower the barrier for researchers and hobbyists who want to move from notebook-scale experiments to actual hardware without rewriting their stack from scratch.