SigFormer: Sparse Signal-Guided Transformer for MultiModal Human Action Segmentation

November 29, 2023 ยท View on GitHub

Introduction

This is an implementation repository for our work. SigFormer: Sparse Signal-Guided Transformer for MultiModal Human Action Segmentation.

Installation

Clone the repository and move to folder:

git clone https://github.com/LIUQI-creat/SigFormer.git

cd SigFormer

To use this source code, you need Python3.8+ and a few python3 packages:

  • pytorch 1.12.1
  • torchvision 0.13.1
  • openpack-torch
  • openpack-toolkit
  • ......

Data

Please download the OpenPack dataset use:

optk-download -d ./data

Train and Test

Training

Use the following commands for training:

python src/train.py

Testing

Obtain the final prediction results:

python src/ensemble_mean.py

In order to get the results in the table below, you need to submit the generated submission.zip file to the online review.

Our submitted file is provided in baiduyun, passcode:ubfo.

Main results

F1 (Macro Average)U0104U0108U0110U0203U0204U0207ALL
SigFormer0.9710.9690.9600.9660.9030.9230.958

Acknowledgement

We greatly appreciate the OpenPack-Challenge-1st repository.