ModelZoo.md

February 9, 2026 ยท View on GitHub

General Pretrained Models

ModelDescriptionInput ImagesAction DimModel SizeLink
Dexbotic-BaseDiscrete vision-language action model (similar to OpenVLA)Single ViewNA7B๐Ÿค— Hugging Face
Dexbotic-CogACT-SArmSingle-arm CogACT modelSingle View7D7B๐Ÿค— Hugging Face
Dexbotic-CogACT-HArmDual-arm CogACT model with multiple views inputMain View + Left Hand-View + Right Hand-View16D7B๐Ÿค— Hugging Face
DM0-baseDM0 base model with Flow Matching action generationUp to 3 Views32D2.4B๐Ÿค— Hugging Face

It is recommended to download the pretrained models into the following folders.

mkdir checkpoints
cd checkpoints
git clone https://huggingface.co/Dexmal/Dexbotic-Base Dexbotic-Base
git clone https://huggingface.co/Dexmal/Dexbotic-CogACT-SArm Dexbotic-CogACT-SArm
git clone https://huggingface.co/Dexmal/Dexbotic-CogACT-HArm Dexbotic-CogACT-HArm
git clone https://huggingface.co/Dexmal/DM0-base DM0-base

Action Dimension Description

Users need to map their data to the action dimensions of the pretrained models. If the data dimension is smaller than the pretrained model dimension, padding will be conducted automatically.

We recommend using the following data formats to fully utilize the pretrained models:

  1. Single-arm end-effector pose: Organize 7D action data as [xyz + rpy + gripper]
  2. Single-arm joint angles: Organize 8D action data as [joints + gripper]
  3. Dual-arm end-effector pose: Organize 14D action data as [left_arm_xyz + left_arm_rpy + left_arm_gripper + right_arm_xyz + right_arm_rpy + right_arm_gripper]
  4. Dual-arm joint angles: Organize 16D action data as [left_arm_joints + left_arm_gripper + right_arm_joints + right_arm_gripper]

Other Dexbotic Models

ModelLink
Dexbotic-ฯ€0๐Ÿค— HF
Dexbotic-ฯ€05๐Ÿค— HF
Dexbotic-NaVILA๐Ÿค— HF
Dexbotic-RL-Base๐Ÿค— HF

Benchmark Results

Libero

ModelLibero-SpatialLibero-ObjectLibero-GoalLibero-10AverageConfigCheckpoint Link
CogACT97.298.090.288.893.6--
DB-CogACT93.897.896.291.894.9libero_cogact.py๐Ÿค— HF
ฯ€096.898.895.885.294.2--
DB-ฯ€09798.29486.493.9libero_pi0.py๐Ÿค— HF
MemVLA98.498.496.493.496.7-
DB-MemVLA97.299.298.493.297.0libero_memvla.py๐Ÿค— HF

CALVIN

Our training and evaluation are conducted under the ABC->D setting.

Model12345Average LengthConfigCheckpoint Link
CogACT83.872.96455.9483.246--
DB-CogACT93.586.780.37669.84.063calvin_cogact.py๐Ÿค— HF
OFT89.179.467.459.851.53.472--
DB-OFT92.880.769.260.251.13.540calvin_oft.py๐Ÿค— HF

SimplerEnv

Our training uses the Bridge dataset and is tested on the WidowX environment.

ModelPut Spoon on TowelPut Carrot on PlateStack Green Block on Yellow BlockPut Eggplant in Yellow BasketAverageConfigCheckpoint Link
CogACT71.750.81567.551.25--
DB-CogACT87.565.2829.1795.8369.45simpler_cogact.py๐Ÿค— HF
OFT12.54.24.210030.23--
DB-OFT91.6776.3943.0694.4476.39simpler_oft.py๐Ÿค— HF
MemVLA75.075.037.5100.071.9--
DB-MemVLA100.066.770.8100.084.4simpler_memvla.py๐Ÿค— HF

ManiSkill2

ModelPickCubeStackCubePickSingleYCBPickSingleEGADPickClutterYCBAverageConfigCheckpoint Link
CogACT557030252040--
DB-CogACT906565403058maniskill2_cogact.py๐Ÿค— HF
OFT404555021--
DB-OFT907555653063maniskill2_oft.py๐Ÿค— HF
ฯ€0958555851066--
DB-ฯ€0958565503065maniskill2_pi0.py๐Ÿค— HF

RoboTwin2.0

Our training uses the RoboTwin2.0 demo_clean dataset and is tested on the Aloha-AgileX demo_clean environment.

ModelAdjust BottleGrab RollerPlace Empty CupPlace Phone StandAverageConfigCheckpoint Link
CogACT877211543.8--
DB-CogACT9989281858.5robotwin2_cogact.py๐Ÿค— HF