Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables

July 18, 2025 · View on GitHub

Introduction

The project provides the official PyTorch implementation with pretrained models for the paper "Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables" (accepted by ICCV 2025).

A 3D lookup table (3D LUT) is a classic yet effective tool for image enhancement and restoration tasks, even in the deep learning era. The 3D LUT efficiently reduces model size and runtime by instantly transforming an input color value into another color value through interpolation of pre-calculated values at the vertices. However, a limitation of 3D LUT transforms is their lack of spatial information, as they convert color values on a point-by-point basis. To address this weakness, researchers have explored spatial-aware 3D LUT methods, which provide spatial features through additional modules. While spatial-aware 3D LUT methods show promising performance, the extra modules introduce a substantial number of parameters and an increased runtime, particularly as the resolution of the input image rises. To tackle this issue, we propose a method for generating image-adaptive 3D LUTs by considering the redundant parts of tables. We introduce an efficient framework that decomposes the 3D LUT into a linear sum of low-dimensional LUTs and utilizes singular value decomposition (SVD). Additionally, we modify the modules for spatial features to be more cache-efficient and image-adaptive, thereby reducing both runtime and improving performance. Our model effectively reduces the number of parameters and runtime, while maintaining competitive performance, as demonstrated by extensive experimental results.

Environment

Ubuntu 18.04.5 LTS
pytorch
CUDA 11.8 & CUDA toolkit
Python 3.8 or later

Structure

dataset/fiveK
       /ppr10k
project/LUTwithBGrid
       demo_img/                      # image for demo
       kerenel_code/                  # extension codes for lut transform and slicing
       lpips/
       pretrained/                    # pretrainded models
       utils/                         # matlab files for evaluation etc.
       cpp_ext_interface.py           # python interfaces for extension code
       datasets.py                    # codes for dataset classes
       demo.py                        # simple demostration
       evaluate.py                    # evaluation code
       model_losses.py                # losses for our model
       models.py                      # implementaion code for our model
       torchvision_x_functional.py
       train.py                       # train code

Installation

Setup proper version of Pytorch and CUDA-toolkit depend on your environment.
Install the other packages that are required for our model. Refer to requirements.txt or using following command:

pip install scipy kornia tqdm opencv-python pillow

Compile and install the extension for LUT transform and Bilateral grid slicing.

python ./kernel_code/bilateral_slicing_LUTTransform/setup.py install

Datasets

dataset/fiveK
        input/
             JPG/480p/                 # sRGB inputs
             PNG/480p_16bits_XYZ_WB/   # XYZ inputs
        expertC/JPG/480p/              # GT
        train.txt                      
        test.txt

       /ppr10k
        train/
             masks/                    # masks for train
             source/                   # sRGB inputs for train
             source_aug/               # sRGB inputs with 5 versions of augmented images for train 
             target_a/                 # GT of expert a for train
             target_b/                 # GT of expert b for train
             target_c/                 # GT of expert c for train
        val/
             masks/                    # masks for test
             source/                   # sRGB inputs for test
             target_a/                 # GT of expert a for test
             target_b/                 # GT of expert b for test
             target_c/                 # GT of expert c for test

The FiveK and PPR10K datasets are used for experiments.
The 3D-LUT project also provides 480p version of FiveK to speed up in the training stage.

Demo

You can execute a simple demonstration using the following command:

python demo.py

This demonstration code enhances an input image (./demo_img/input/a4632.jpg) and saves the result (./demo_img/result/a4632.png).
Or you can specify the pretrained model, input image path, and output image path.

python demo.py --pretrained_path=[pretrained model path] --input_path=[input image path] --output_path=[output image path]

Train

General Train

python train.py --dataset_name=[fiveK of ppr10k] --input_color_space=[sRGB or XYZ] --batch_size=[batch size]

FiveK sRGB photo retouching

python train.py --dataset_name=fiveK --input_color_space=sRGB

FiveK XYZ tone mapping

python train.py --dataset_name=fiveK --input_color_space=XYZ

PPR10K sRGB photo retouching

python train.py --dataset_name=ppr10k --input_color_space=sRGB --batch_size=16 --lut_n_ranks=10 --lut_weight_ranks=10

Evaluation

Generate and save the test images using the following command:

python evaluate.py --pretrained_path=[path of pretrained model] --dataset_name=[fivek or ppr10k] --input_color_space=[sRGB or XYZ]

We provide pretrained models in ./pretrained/.

Calculate the score using Matlab code

FiveK

(matlab) > ./utils/fivek_calculate_metrics.m [evaluate image dir] [GT dir]

PPR10K

(matlab) > ./utils/ppr10k_calculate_metrics.m [evaluate image dir] [GT dir] [mask dir]

Citation

@inproceedings{kim2025SVDLUT,
  title={Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables},
  author={Kim, Wontae, Lee, Keuntek, and Cho, Nam Ik},
  booktitle={International Conference on Computer Vision},
  year={2025}
}

Acknowledgements

This project is based on the following open-source projects. We sincerely appreciate their authors for allowing us to use their excellent works as open-source.