Q: What's the difference between the RoboMME Challenge leaderboard and the regular RoboMME leaderboard?
A: The regular leaderboard is evaluated by participants themselves, who run the evaluation and open a pull request to update their results. Submissions can be made at any time. The RoboMME Challenge leaderboard is evaluated by the organizers using held-out test episodes (a total of 800 episodes), and the results are updated and maintained on this page. Submissions are only accepted during the challenge period.
Q: Can I use external data, other VLA backbones, LLM APIs, etc.?
A: Yes. Any methods or resources are allowed, but you may not use the RoboMME repository itself to generate additional training data, as this would be unfair. You are welcome to use training data beyond RoboMME, as long as all external resources are clearly described in your method description.
Q: Can I use human-in-the-loop methods during testing?
A: No. Participants must not attempt to manually intervene policy rollouts, as this would unfairly influence the evaluation results.
Q: Can I write rules or design prompts to improve policy performance?
A: The goal of RoboMME is to evaluate robotic generalist policies, so we do not encourage participants to write hard-coded task-specific rules or prompts solely to boost performance on particular tasks.
Q: Is there a team size limit?
A: There is no strict limit, but each team should submit under a single team name and submit only one model.
Q: Will top teams need to provide extra details?
A: Yes. Top teams will be asked to share a brief method description and reproducibility details.
Q: Can I present my work at the workshop?
A: Workshop presentations follow the official procedure. The challenge is independent of the workshop paper track, so if you want to present at the workshop, you must submit your paper separately.
Q: What if my internet connection is unstable for remote evaluation?
A: We will work with you together to help you complete setup before Phase 2 starts, so we recommend submitting early to
leave enough time to debug connection issues. If your own server does not allow public IP, you can
either rent a cloud server (e.g., Lambda Labs) that allows public IP or choose other options instead.
Q: How can I contact the organizers if I have issues?
A: For any RoboMME Challenge-related questions, please email robomme2026@gmail.com.
You can also join our mailing list, robomme-cvpr-challenge-2026@googlegroups.com, to receive the latest updates.
For real-time discussion, please join the WeChat and Discord channels linked above.