README.md
January 23, 2026 ยท View on GitHub
Skywork-UniPic
Unified multimodal model for image editing, generation, and understanding
๐ Overview
Welcome to the Skywork-UniPic repository!
This repository hosts the model weights and official implementations of unipic unified multimodal series, featuring three distinct modeling paradigms:
-
UniPic-3 (README) โ ๐ฅ Open-source SOTA Multi-Image Editing Model. Unified framework for single-image editing & multi-image composition. Supports 1โ6 input images with flexible resolutions. 8-steps inference with 12.5ร speedup via CM + DMD distillation.
-
UniPic-2(README) โ SD3.5M-Kontext and MetaQuery variants based on Efficient Architectures with Diffusion Post-Training, delivering state-of-the-art performance in text-to-image generation, fine-grained image editing, and multimodal reasoning.
-
UniPic-1(README) โ 1.5B parameters, Unified Autoregressive Modeling for joint visual understanding and generation, enabling a single transformer to handle both perception and synthesis tasks.
๐ฅ Latest News
โจ Key Features
- ๐จ Text-to-Image Generation โ High-fidelity synthesis from natural language prompts.
- ๐ Image Editing โ Seamless inpainting, outpainting, and object manipulation.
- ๐ผ Image Understanding โ Robust perception capabilities for various visual tasks.
- โก Efficient Architecture โ Optimized for both accuracy and deployability.
๐ License
This project is licensed under the MIT License โ see the LICENSE file for details.