LLM Pre/Post processing

This project run inference for LLM model and profiles for TTFT

Machice Requirements

Arch: x86_64 or arm64
OS : Ubuntu 18 to 22
RAM : Min 6Gb
ROM : Min 64Gb

Prerequisite

Install python 3.10

Create a virtual environment

venv <path_to_python3.10> <env_name>
source <envpath>/bin/activate

Installation

git clone -b main https://github.com/sudhir-mcw/llm.git
cd llm
pip install -r requirements.txt

Download LLAMA2 Model

Place the model just outside llm repo

cd <to_folder_outside_llm_repo>
wget https://drive.google.com/file/d/1ny1GhyIvT-hPoad3ROewgqQ5YsrGWZ8f/view?usp=drive_link
unzip TinyLlama-1.1B-Chat-v0.6.zip

Run

cd llm
python run_inference.py

Run for TTFT

Replace the files from llm/util_changes with

path_to_pyvenv/lib/python3.12/site-packages/transformers/generation/utils.py
path_to_pyvenv/lib/python3.12/site-packages/transformers/generation/configuration_utils.py

cd llm
python run_inference_ttft.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
llama		llama
util_changes		util_changes
.gitignore		.gitignore
README.md		README.md
bench.py		bench.py
download.sh		download.sh
example_chat_completion.py		example_chat_completion.py
example_text_completion.py		example_text_completion.py
params.json		params.json
requirements.txt		requirements.txt
run_inference.py		run_inference.py
run_inference_ttft.py		run_inference_ttft.py
setup.py		setup.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Pre/Post processing

Machice Requirements

Prerequisite

Installation

Download LLAMA2 Model

Run

Run for TTFT

About

Releases

Packages

Contributors 2

Languages

sudhir-mcw/llm

Folders and files

Latest commit

History

Repository files navigation

LLM Pre/Post processing

Machice Requirements

Prerequisite

Installation

Download LLAMA2 Model

Run

Run for TTFT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages