Cortexbench

March 31, 2023 ยท View on GitHub

CORTEXBENCH is a collection of 17 EAI tasks drawn from 7 existing benchmarks, designed to assess the efficacy of pre-trained visual representations (PVRs) in supporting a variety of EAI applications. Each task includes a downstream policy learning paradigm and evaluation protocol adhering to community standards. By using fixed tasks and learning methods, CORTEXBENCH evaluates the contribution of PVRs and measures progress towards developing an artificial visual cortex for embodied intelligence. This benchmark conducts the largest and most comprehensive study of PVRs to-date.

CortexBench Workflow

CORTEXBENCH includes tasks from 7 diverse benchmarks with different combinations of observations, actions, and goals as well as different standard policy learning paradigms:

Benchmark SuiteObservation SpaceAction SpaceGoal SpecificationPolicy Learning
AdroitRGB + proprio.Continuous-IL
MetaworldRGB + proprio.Continuous-IL
DMControlRGB + proprio.Continuous-IL
TrifingerRGB + proprio.ContinuousGoal Image/PositionIL
ObjectNavRGB + proprio.DiscreteObject CategoryIL
ImageNavRGBDiscreteGoal ImageRL
MobilePickRGB + proprio.ContinuousGoal PositionRL

Installation

To install CortexBench, please follow the instructions in ../INSTALLATION.md.

Datasets

Benchmarks from CortexBench use datasets described in DATASETS.md.