Mojo - LangTrain

We developed LangTrain to the make it easier to teach LLMs your weird programming lanugage. Interanaly we use a progrmaming language called LL that is propritary to us. It is clear that no LLM like codellama or GPT4 has been trained on such a language. That is why we built LangTrain.

Demo Video!

Video Link

Required Dependancies

Node
requirements.txt

Requied Models

https://huggingface.co/kirp/TinyLlama-1.1B-Chat-v0.2-bin/resolve/main/tok_tl-chat.bin

https://huggingface.co/kirp/TinyLlama-1.1B-Chat-v0.2-bin/resolve/main/tl-chat.bin

These should be place in root (/langtrain). They are used for inference in mojo.

Startup

Make sure you install pip dependancies from requirments.txt first: pip install -r requirements.txt
Also if using conda, have that enviorment activated

Frontend:

mojo ctk.mojo

Backend:

mojo api_server.mojo

At this point you should have the backend inference server running and the front-end customertkinter interface.

Concepts

What exactally are we trying to do here? Here is a diagram for help understanding:

The LLM has been in some way informed about the syntax & semantics of your language. That could be through simple prompt engineering, but may be more effective and concise if taught a EBNF grammer (we are looking into this).
When the LLM generates code based on the user prompt, that code is validated by the interpreter. This is then sent to the human in the loop for review and examination.
The human validator can approve code if it matched the prompt correctly, or disapprove it.
The saved data is used to fine-tune the LLM using LoRA or QLoRA.

Roadmap

Known Bugs

Server Crashes randomly
console does not display interpreter logs

Citations

Thanks to Aydyn who put together the llama.mojo that we use for inference on our server!

@misc{llama2.mojo,
  author = {Aydyn Tairov}, 
  title = {Inference Llama2 in one file of pure Mojo},
  year = {2023},
  month = {09},
  howpublished = {\url{https://github.com/tairov/llama2.mojo}},
  note = {Llama2 Mojo, MIT License}
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
__pycache__		__pycache__
interpreters		interpreters
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
api_client.mojo		api_client.mojo
api_server.mojo		api_server.mojo
ctk.mojo		ctk.mojo
ctk.py		ctk.py
deleteOldFiles.sh		deleteOldFiles.sh
helpers.py		helpers.py
langtrain.png		langtrain.png
list.mojo		list.mojo
llama2.mojo		llama2.mojo
output.log		output.log
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mojo - LangTrain

Demo Video!

Required Dependancies

Requied Models

Startup

Concepts

Roadmap

Known Bugs

Citations

About

Releases

Packages

Contributors 3

Languages

andrewlayer/langtrain

Folders and files

Latest commit

History

Repository files navigation

Mojo - LangTrain

Demo Video!

Required Dependancies

Requied Models

Startup

Concepts

Roadmap

Known Bugs

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages