Building-GPT-2-Assistant-AI-Chatbot

This project involves developing a chatbot using a pretrained GPT-2 model. The model has been trained on the OASST1 dataset and is designed to to assist users with various tasks and queries.. The fine-tuned model is uploaded to Hugging Face under the name KhantKyaw/Chat_GPT-2.

Introduction

Welcome to the GPT-2 Assistant AI Chatbot project! This project showcases how to leverage the capabilities of GPT-2, a state-of-the-art language model developed by OpenAI, to build an intelligent chatbot. By fine-tuning GPT-2 on the Open Assistant Conversations (OASST1) dataset, I've created a model capable of understanding and responding to a wide array of user inputs, making it a versatile assistant.

Model

The GPT-2 model used in this project is a highly advanced text generation model known for its ability to produce coherent and contextually relevant sentences. The model has been fine-tuned specifically for conversational tasks, enabling it to provide meaningful and helpful responses. You can find the original pretrained model on Hugging Face under the name openai-community/gpt2.

Dataset

The OASST1 dataset used for fine-tuning contains conversations that simulate interactions between a user and an assistant. The dataset is split into two parts:

df_train.csv: Training dataset
df_val.csv: Validation dataset

Installation

To use the assistant chatbot, you'll need to install the required packages. You can do this using pip:

pip install transformers
pip install torch

Usage

To use the fine-tuned chatbot model for generating responses:

from transformers import GPT2Tokenizer, GPT2LMHeadModel

def generate_response(input_text):

    inputs = tokenizer(input_text, return_tensors="pt")
    output_sequences = model.generate(
        input_ids=inputs['input_ids'],
        attention_mask=inputs['attention_mask'],
        max_length=100,  # Adjusted max_length
        temperature=0.3,
        top_k=40,
        top_p=0.85,
        num_return_sequences=1,
        no_repeat_ngram_size=2,
        pad_token_id=tokenizer.eos_token_id,
        eos_token_id=tokenizer.eos_token_id,
        early_stopping=True,
        do_sample=True,
        use_cache=True,
    )

    full_generated_text = tokenizer.decode(output_sequences[0], skip_special_tokens=True)

    bot_response_start = full_generated_text.find('[Bot]') + len('[Bot]')
    bot_response = full_generated_text[bot_response_start:]
    return bot_response


model_name = 'KhantKyaw/Chat_GPT-2'
tokenizer = GPT2Tokenizer.from_pretrained(model_name)
model = GPT2LMHeadModel.from_pretrained(model_name)
response = generate_response(user_input)
print("Chatbot:", response)

Fine-Tuning Process

The repository includes the script for fine-tuning the GPT-2 model. You can use Fine_Tuning_GPT2_Chatbot.ipynb to start the fine-tuning process.

Contributing

Contributions are welcome! If you have any ideas, suggestions, or find a bug, please open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
src		src
LICENSE		LICENSE
README.md		README.md
gpt2.jpg		gpt2.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building-GPT-2-Assistant-AI-Chatbot

Table of Contents

Introduction

Model

Dataset

Installation

Usage

Fine-Tuning Process

Contributing

License

About

Releases

Packages

Languages

License

khantkyaw6339/Building-GPT-2-Assistant-AI-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Building-GPT-2-Assistant-AI-Chatbot

Table of Contents

Introduction

Model

Dataset

Installation

Usage

Fine-Tuning Process

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages