SEMA: Semantic Distance Adversarial Learning for Text-to-Image Synthesis (TMM' 23)

Official Pytorch implementation for our paper Semantic Distance Adversarial Learning for Text-to-Image Synthesis

Requirements

python 3.8
Pytorch 1.9
transformers 4.8.1

Installation

Clone this repo.

git clone https://github.com/yuanrr/SEMA

conda create -n SEMA
conda activate SEMA
pip install -r requirements.txt

Preparation

Datasets

Download the preprocessed metadata for birds coco and extract them to data/
Download the birds image data. Extract them to data/birds/
Download coco2014 dataset and extract the images to data/coco/images/

Training

Code for training SEMA will be released soon. Hope to get your continued attention.

Evaluation

Download Pretrained Model

SEMA w/o BERT for coco (password: guvx)

Evaluate SEMA

We synthesize about 30k images from the test descriptions and evaluate the FID between synthesized images and test images of each dataset.

synthesize images by the given pretrained model

python sampling.py

evaluate the FID score

python test_fid.py

Performance

The released model achieves better performance than SEMA paper version.

Model	COCO-FID↓
SEMA w/o BERT (paper)	17.51
SEMA w/o BERT (released model)	~16.5
SEMA (paper)	16.31

The code is released for academic research use only. Please contact us if you have any questions. Bowen Yuan

Reference

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
cfg		cfg
miscc		miscc
.gitignore		.gitignore
DAMSM.py		DAMSM.py
LICENSE		LICENSE
README.md		README.md
datasets.py		datasets.py
framework.png		framework.png
img_data.py		img_data.py
inception.py		inception.py
model.py		model.py
requirements.txt		requirements.txt
sampling.py		sampling.py
test_fid.py		test_fid.py
truncation.py		truncation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SEMA: Semantic Distance Adversarial Learning for Text-to-Image Synthesis (TMM' 23)

Requirements

Installation

Preparation

Datasets

Training

Evaluation

Download Pretrained Model

Evaluate SEMA

Performance

About

Releases

Packages

Languages

License

yuanrr/SEMA

Folders and files

Latest commit

History

Repository files navigation

SEMA: Semantic Distance Adversarial Learning for Text-to-Image Synthesis (TMM' 23)

Requirements

Installation

Preparation

Datasets

Training

Evaluation

Download Pretrained Model

Evaluate SEMA

Performance

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages