The Safety Standard
for Embodied AI
Submit your robot policy, we simulate it, and our VLM judge delivers verdicts with 94% human-level accuracy.
How It Works
From submission to verdict in under 5 minutes. No infrastructure to manage.
Submit Your Policy
Upload your trained robot policy as a Python file or ZIP archive. We support MuJoCo-based policies.
We Simulate
Your policy runs in our cloud MuJoCo environment. We record video and capture detailed metrics.
VLM Judges
Our Vision-Language Model analyzes the video and delivers a verdict with detailed reasoning.
Why BotArena?
The only platform that combines simulation, VLM evaluation, and community competition.
VLM-Powered Judging
Our Vision-Language Model analyzes simulation videos like a human expert. Research shows VLM judges achieve 94% correlation with human evaluators.
Public Leaderboard
Compete with the community. See how your policy stacks up against others. Track your progress over time.
Multiple Scenarios
Test your policy across diverse challenges: household tasks, manipulation, navigation, and more.
Detailed Feedback
Get actionable insights. Our VLM explains why your policy passed or failed with specific observations.
Cloud Infrastructure
No GPUs to manage. No MuJoCo installations. Submit from anywhere and get results in minutes.
Open Community
Share results, learn from others, and improve together. Join our Discord to connect with fellow roboticists.
Available Scenarios
More scenarios added regularly
The Platform Vision
BotArena is just the beginning. We're building the verification layer for robot AI.
BotArena
The Benchmark
Submit policies, get VLM verdicts, compete on the leaderboard.
BotRegistry
Hugging Face for Robots
A model registry designed for embodied AI with robot-specific metadata.
BotCI
GitHub Actions for Robots
Continuous integration that tests your policy on every commit.
BotCertify
The Safety Standard
Enterprise safety certification for production robot deployments.
“When a company deploys a robot to a factory, the insurer asks: What's the BotCertify Score?”
Our vision for 2027 and beyond