๐Ÿš€ MixLinear

April 24, 2026 ยท View on GitHub

ICLR 2026 PyTorch Parameters License

Welcome to the official repository of MixLinear, the ICLR 2026 paper:

MixLinear: Extreme Low-Resource Multivariate Time Series Forecasting with 0.1K Parameters
Aitian Ma, Dongsheng Luo, Mo Sha
1Knight Foundation School of Computing and Information Sciences, Florida International University

๐Ÿ“„ Paper โ€ข ๐ŸŽฅ Presentation & Slides โ€ข ๐Ÿ‘ค Profile โ€ข ๐ŸŽ“ Google Scholar

๐ŸŒ Additional Materials

๐ŸŽ‰ Published as a conference paper at ICLR 2026


๐Ÿ”ฅ TL;DR

MixLinear is a dual-domain forecasting model that achieves state-of-the-art long-term time series forecasting performance using just 0.1K parameters โ€” up to 98% fewer parameters and 3.2ร— faster inference than existing efficient baselines.

By processing local trends in the time domain and global trends in the frequency domain, MixLinear reduces model complexity from O(nยฒ) to O(n) while preserving accuracy, enabling deployment on edge devices, embedded systems, and low-resource environments.


โœจ Why MixLinear?

FeatureMixLinear
Parameters~0.1K
Time ComplexityO(n log n)
Space ComplexityO(n)
ArchitectureTime + Frequency Dual Pathway
Target ScenariosEdge / IoT / Small Data / Real-Time

๐Ÿ† Up to 16.2% MSE improvement over SparseTSF
โšก Up to 3.2ร— faster inference
๐Ÿ“ฆ 81โ€“98% parameter reduction vs. lightweight baselines
๐ŸŒ Strong cross-domain generalization


๐Ÿง  Core Idea

MixLinear is motivated by a key observation about time series structure:

Local trends are best modeled in the time domain, while global trends are sparse in the frequency domain.

Instead of forcing a single architecture to model both, MixLinear introduces a dual-domain framework:

๐Ÿ”น Segment-Based Trend Extraction (Time Domain)

Local temporal patterns are captured using factorized linear transformations that disentangle:

  • Intra-segment correlations (local shape, slopes)
  • Inter-segment correlations (long-range drift)

This reduces dense forecasting layers from O(nยฒ) parameters to O(n).

๐Ÿ”น Adaptive Low-Rank Spectral Filtering (Frequency Domain)

Global trends are processed via learnable rank-constrained complex filters, compressing spectral representations into an ultra-low-dimensional latent space while preserving dominant frequency modes.

๐Ÿ”น Unified Forecasting

Final prediction combines both pathways:

Y = F_segment(X) + F_frequency(X)

This additive fusion preserves domain-specific representations while enabling joint optimization, achieving an unprecedented efficiencyโ€“accuracy tradeoff.


๐Ÿ—๏ธ Architecture Overview

MixLinear Architecture


๐Ÿ“Š Results

Parameter Efficiency

Parameter Comparison

MixLinear maintains near-linear parameter growth and uses as few as 45โ€“176 parameters, compared to 1K+ for SparseTSF and 10K+ for FITS.


Accuracy vs Efficiency

Main Results

MixLinear achieves competitive or superior forecasting accuracy across eight benchmarks while using orders of magnitude fewer parameters.


Runtime Performance

Inference Time

MixLinear delivers:

  • Up to 3.2ร— speedup in low-dimensional datasets
  • Up to 2.6ร— speedup in high-dimensional datasets

โšก Getting Started

Environment Setup

conda create -n MixLinear python=3.8
conda activate MixLinear
pip install -r requirements.txt

Data Preparation

Download datasets from Autoformer:

https://drive.google.com/drive/folders/1ZOYpTUa82_jCcxIdTmyr0LXQfvaM9vIy

Place CSV files in:

./dataset

Example:

./dataset/ETTh1.csv

Training

Run single dataset:

sh scripts/MixLinear/etth1.sh

๐Ÿงช Using MixLinear on Your Own Data

MixLinear does not require strong periodicity assumptions and generalizes well across domains.

Recommended hyperparameters:

  • segment_len โ€” local granularity
  • rank โ€” spectral compression level (default: 2)

The model is stable across wide ranges of both.


๐Ÿ“š Citation

@inproceedings{ma2026mixlinear,
  title={MixLinear: Extreme Low-Resource Multivariate Time Series Forecasting with 0.1K Parameters},
  author={Ma, Aitian and Luo, Dongsheng and Sha, Mo},
  booktitle={International Conference on Learning Representations (ICLR)},
  year={2026}
}

๐Ÿ“ฌ Contact

Aitian Ma โ€” ama003@fiu.edu
Dongsheng Luo โ€” dluo@fiu.edu
Mo Sha โ€” msha@fiu.edu

Knight Foundation School of Computing and Information Sciences
Florida International University


๐Ÿ™ Acknowledgement

We thank the following repositories for datasets and baseline implementations: