NanoGPT Implementation

NanoGPT is a lightweight implementation of the GPT (Generative Pre-trained Transformer) model based on the Karpathy's "minGPT" repository. This README provides instructions on how to clone and use the NanoGPT repository for text generation tasks.

Prerequisites

Before using NanoGPT, ensure that you have the following dependencies installed:

PyTorch
NumPy
Transformers library from Hugging Face: pip install transformers
Tiktoken library from OpenAI: pip install tiktoken
Wandb library for optional logging: pip install wandb
tqdm library: pip install tqdm
Uvicorn and FastAPI: pip install uvicorn fastapi fire

Clone the Repository

To get started, clone the NanoGPT repository from GitHub:

git clone https://github.com/vishwasg217/nanogpt.git

Prepare Data

The repository provides a script to prepare data for training. You can use it by providing a Gist file URL:

python data/shakespeare/prepare.py --url <url>

Replace <url> with the Gist file URL containing your training data.

Training the Model

To train the NanoGPT model, use the following command:

python train.py config/train_shakespeare_char.py --device=mps --compile=False --eval_iters=20 --log_interval=1 --block_size=64 --batch_size=12 --n_layer=4 --n_head=4 --n_embd=128 --max_iters=2000 --lr_decay_iters=2000 --dropout=0.0

Adjust the command-line arguments based on your requirements. This command will train the model using the specified configuration.

Generating Text

To generate text using the trained NanoGPT model, use the following command:

python test.py --start <start_text>

Replace <start_text> with the desired starting text for text generation.

Running the API

To run the UVicorn server and use the API for text generation, follow these steps:

Start the server:

uvicorn app.main:app --reload

Open a new terminal and use the curl command to get a text generation response:

echo -e "Enter the start value: "
read start

echo -e "\nGenerating Text...\n"

curl -X 'POST' \
  'http://127.0.0.1:8000/generate-text' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "start": "'"$start"'"
}'

Follow the prompt and enter the desired starting text when prompted. The generated text will be displayed in the terminal.

Alternatively, you can also run the script.sh file to prepare, train and run the API in one go.

chmod +x script.sh
./script.sh

gRPC Implementation

To run the gRPC implementation, follow these steps:

Start the gRPC server by running the rpc.py script:
```
python rpc.py
```
This will start the gRPC server and make it ready to accept requests.
Start the FastAPI server by running the server2:app with uvicorn:
```
uvicorn server2:app --host 0.0.0.0 --port 8080
```
This will start the FastAPI server, which will act as a client to communicate with the gRPC server.
Enter the start value for text generation when prompted:
```
echo -e "Enter the start value: "
read start
```
This will allow you to input the start value, which will be used as a parameter for text generation.

Generate the text by making an HTTP request to the FastAPI server:

echo -e "\nGenerating Text...\n"

curl -X 'POST' \
  'http://127.0.0.1:8080/generate-text' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "start": "'"$start"'"
}'

echo -e "\n\ntext generation done!!\n"

This will send an HTTP POST request to the FastAPI server with the specified start value. The server will then forward the request to the gRPC server for text generation.

Observe the generated text in the response from the FastAPI server.

Ensure that both the gRPC server and FastAPI server are running simultaneously before making the HTTP request for text generation.

Alternatively, you can also run the grpc.sh script to run the gRPC implementation.

chmod +x grpc.sh
./grpc.sh

You have now successfully set up and used NanoGPT for text generation!

Note: Please refer to the original repository or documentation for any further details or updates on NanoGPT.

Name		Name	Last commit message	Last commit date
Latest commit History 182 Commits
assets		assets
config		config
data		data
out-shakespeare-char		out-shakespeare-char
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bench.py		bench.py
commands.ipynb		commands.ipynb
configurator.py		configurator.py
grpc.sh		grpc.sh
model.py		model.py
news_text.txt		news_text.txt
nohup.out		nohup.out
output_text.txt		output_text.txt
process_news.ipynb		process_news.ipynb
rpc.py		rpc.py
sample.py		sample.py
scaling_laws.ipynb		scaling_laws.ipynb
script.sh		script.sh
server.py		server.py
server2.py		server2.py
test.py		test.py
text_generation.proto		text_generation.proto
text_generation_pb2.py		text_generation_pb2.py
text_generation_pb2_grpc.py		text_generation_pb2_grpc.py
train.py		train.py
transformer_sizing.ipynb		transformer_sizing.ipynb
uvicorn.log		uvicorn.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NanoGPT Implementation

Prerequisites

Clone the Repository

Prepare Data

Training the Model

Generating Text

Running the API

gRPC Implementation

About

Releases

Packages

Contributors 28

Languages

License

vishwasg217/nanogpt

Folders and files

Latest commit

History

Repository files navigation

NanoGPT Implementation

Prerequisites

Clone the Repository

Prepare Data

Training the Model

Generating Text

Running the API

gRPC Implementation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 28

Languages

Packages