GitHub - shifs999/Transformer-Translation

Chatur-GPT

This project is an implementation of the Attention is all you Need paper.

Model Description

Transformer Based (124M): Chatur-GPT is a Generative Pre-Trained model built from scratch which is trained on a multilingual dataset, enabling effective translation across various languages.
Visualization: Added an attention visualizer to dynamically display the attention patterns of the pre-trained model, providing insights into how the model interprets and prioritizes different parts of input sequences.
Dataset: The model has been trained on Opus Books a Multilingual dataset imported from
Metrics: The model has been trained for 20 epochs and the weights have been stored which are later used for inference and visualization. A wordlevel tokenizer has been used for text pre-processing and an input embedding of 512 dimensions is used for spatial positioning. The model was able to achieve a BLEU score of 0.68, indicating strong translation accuracy and context preservation across languages.

How to run

Install the requirements.

pip install -r requirements.txt

To inference the model, run Inference.ipynb

run Inference.ipynb

To change the language of translation

>> Go to config.py
>> Change "lang_tgt" to desired language (the parameters are customizable)

To train the model

python train.py

Note: The code written is device agnostic (can run on both a cpu and a gpu)

Architecture

Overview

Output

Check the output.txt file for more examples (en-it translation)

Note: The output can be syntactically and semantically enhanced by running it for more epochs ~ 30.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Inference.ipynb		Inference.ipynb
README.md		README.md
attention_visual.ipynb		attention_visual.ipynb
config.py		config.py
dataset.py		dataset.py
gpu_train_demo.ipynb		gpu_train_demo.ipynb
model.py		model.py
output.txt		output.txt
requirements.txt		requirements.txt
train.py		train.py
transform20fps.gif		transform20fps.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chatur-GPT

Model Description

How to run

Architecture

Overview

Output

Attention Visualizer

About

Releases

Packages

Languages

shifs999/Transformer-Translation

Folders and files

Latest commit

History

Repository files navigation

Chatur-GPT

Model Description

How to run

Architecture

Overview

Output

Attention Visualizer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages