Benchmark Datasets

July 22, 2025 ยท View on GitHub

We evaluated Llama3-MS-CLIP on several Earth Observation datasets, using their test splits.

All benchmark datasets besides BigEarthNet_RGB, BigEarthNet_MS and ForestNet_RGB have to be organized in an ImageFolder format with the following folder structure:

  benchmark_datasets/dataset_name/class_1/xxx.png
  benchmark_datasets/dataset_name/class_1/xxy.png
  benchmark_datasets/dataset_name/class_1/xxz.png
  .
  .
  .
  benchmark_datasets/dataset_name/class_x/xxx.tif
  benchmark_datasets/dataset_name/class_x/xxy.tif
  benchmark_datasets/dataset_name/class_x/xxz.tif

The dataset_name from the above folder structure should match the following names:

METERML_RGB,
EuroSAT_RGB,
RESISC45_RGB,
AID_RGB,
METERML_NAIP,
METERML_MS,
EuroSAT_MS,

Both BigEarthNet_RGB and BigEarthNet_MS use a single dataset called BigEarthNet and depending on whether the mode is RGB or MS the appropriate bands will be selected automatically. The dataset is organized in the following format:

images are composed of multiple single channel geotiffs
labels are multiclass, stored in a single json file per image

The following is the folder structure for BigEarthNet:

benchmark_datasets/BigEarthNet/sentinel-2/xxx_image_folder/band_1.tif
.
.
benchmark_datasets/BigEarthNet/sentinel-2/xxx_image_folder/band_12.tif
benchmark_datasets/BigEarthNet/sentinel-2/xxx_image_folder_name/labels_metadata.json
.
.
.
.
benchmark_datasets/BigEarthNet/sentinel-2/xxz_image_folder/band_12.tif
benchmark_datasets/BigEarthNet/sentinel-2/xxz_image_folder_name/labels_metadata.json

ForestNet_RGB contains 3 csv files for train test and validation and these csv files contain the image paths and the class labels. Benchmarking is done on the test split. The following is the folder structure for ForestNet_RGB:

benchmark_datasets/ForestNet_RGB/examples/xxx_image_folder/images/visible/composite.png
.
.
benchmark_datasets/ForestNet_RGB/examples/xxz_image_folder/images/visible/composite.png
.
.
.
.
benchmark_datasets/ForestNet_RGB/test_csv
benchmark_datasets/ForestNet_RGB/val_csv
benchmark_datasets/ForestNet_RGB/train_csv