Artificial rapper

Transformer-based rap lyrics generator. Highly based on the Rapformer paper.

Data preprocessing

We use the dataset from Kaggle. To create the train and test datasets perform the following:

Download the files artists-data.csv and lyrics-data.csv from Kaggle and place them in data/ folder in the project root.
Run the following to generate files which will contain train and test examples:

python3 data.py

This will create 4 files: pretrain_[x|y].txt and finetune_[x|y] in the data/ folder. After this step the datasets are available by using LyricsDatasetProvider and LyricsDataset from dataset.py.

After that run the following to reduce dataset sizes and the number of distinct tokens.

python3 filter.py

This will override the pretrain_[x|y] and finetune_[x|y] files.

Training

Wandb

To use Wandb

pip install wandb

wandb login

and paste your API key.

Training script

Running train.py utilizes params, to check available run:

python train.py --help

and to train model with chosen params run:

python train.py [params]

alternatively use training script to easily change previously used parameters

./run_train.sh

Inference

Run infer.py with appropriate parameters. This will generate results.txt file.

Rhyme enhancement

Run rhyme_enhancement.py. This assumes that there exists the file named results.txt which contains generated examples. It prints the rhyme-enhanced examples into the screen.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
metrics		metrics
model		model
.gitignore		.gitignore
README.md		README.md
data.py		data.py
dataset.py		dataset.py
evaluatemetrics.py		evaluatemetrics.py
fileparser.py		fileparser.py
filter.py		filter.py
infer.py		infer.py
parse.py		parse.py
prompt_generator.py		prompt_generator.py
requirements.txt		requirements.txt
rhyme_enhancement.py		rhyme_enhancement.py
run_train.sh		run_train.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Artificial rapper

Data preprocessing

Training

Wandb

Training script

Inference

Rhyme enhancement

About

Releases

Packages

Contributors 3

Languages

dmkwis/POLTORA-TALERZA

Folders and files

Latest commit

History

Repository files navigation

Artificial rapper

Data preprocessing

Training

Wandb

Training script

Inference

Rhyme enhancement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages