Generative Models

May 1, 2020 ยท View on GitHub

First of all, we thank following repositories for their work on high-quality image synthesis

Compared to InterFaceGAN repo, this repo optimizes the common API for generator, improves the pipeline to convert tensorflow weights to pytorch version, and involves model for StyleGAN2.

Pre-trained tensorflow weights (either officially released or trained by ourselves) can be found from following links. Please download them and save to folder pretrain/tensorflow/ before using.

NOTE: The officially released models are simply mirrored by us from the above three repositories, just in case they are not available from the official links.

PGGAN Official
Face
celebahq-1024x1024
Indoor Scene
bedroom-256x256livingroom-256x256diningroom-256x256kitchen-256x256
Outdoor Scene
churchoutdoor-256x256tower-256x256bridge-256x256
Other Scene
restaurant-256x256classroom-256x256conferenceroom-256x256
Animal
person-256x256cat-256x256dog-256x256bird-256x256
horse-256x256sheep-256x256cow-256x256
Transportation
car-256x256bicycle-256x256motorbike-256x256bus-256x256
train-256x256boat-256x256airplane-256x256
Furniture
bottle-256x256chair-256x256pottedplant-256x256tvmonitor-256x256
diningtable-256x256sofa-256x256
StyleGAN Official
Model (Dataset)Training SamplesTraining Duration (K Images)FID
ffhq-1024x102470,00025,0004.40
celebahq-1024x102430,00025,0005.06
bedroom-256x2563,033,04270,0002.65
cat-256x2561,657,26670,0008.53
car-512x3845,520,75646,0003.27
StyleGAN Ours
Model (Dataset)Training SamplesTraining Duration (K Images)FID
Face
ffhq-256x25670,00025,0005.70
ffhq-512x51270,00025,0005.15
LSUN Indoor Scene
livingroom-256x2561,315,80230,0005.16
diningroom-256x256657,57125,0004.13
kitchen-256x2561,000,00030,0005.06
LSUN Indoor Scene Mixture
apartment-256x2564 * 200,00060,0004.18
LSUN Outdoor Scene
churchoutdoor-256x256126,22730,0004.82
tower-256x256708,26430,0005.99
bridge-256x256818,68725,0006.42
LSUN Other Scene
restaurant-256x256626,33150,0004.03
classroom-256x256168,10350,00010.10
conferenceroom-256x256229,06950,0006.20
StyleGAN2 Official
Model (Dataset)Training SamplesTraining Duration (K Images)FID
ffhq-1024x102470,00025,0002.84
church-256x256126,22748,0003.86
cat-256x2561,657,26688,0006.93
horse-256x2562,000,340100,0003.43
car-512x3845,520,75657,0002.32