Skip to content
View gqcao's full-sized avatar

Block or report gqcao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

35 repositories

LLM inference in C/C++

C++ 72,717 10,476 Updated Feb 2, 2025

Inference Llama 2 in one file of pure C

C 17,961 2,181 Updated Aug 6, 2024

Llama 2 Everywhere (L2E)

C 1,509 43 Updated Jan 16, 2025

Train transformer language models with reinforcement learning.

Python 10,975 1,460 Updated Feb 2, 2025

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,127 89 Updated Dec 12, 2024

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

1,127 56 Updated Sep 25, 2024

On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent

291 16 Updated Mar 14, 2024

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 72,093 7,836 Updated Feb 1, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,913 5,442 Updated Feb 2, 2025

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,329 124 Updated Jan 24, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,358 887 Updated Jul 1, 2024

[Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"

Python 228 12 Updated Nov 15, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 45,891 5,478 Updated Dec 18, 2024
Python 672 70 Updated Jan 28, 2025

Work with LLMs on a local environment using containers

TypeScript 192 44 Updated Jan 31, 2025

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,131 88 Updated Aug 6, 2024

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Python 1,609 220 Updated Apr 9, 2024

Build large language model (LLM) apps with Python, ChatGPT and other models. This is the companion repository for the book on generative AI with LangChain.

Jupyter Notebook 699 287 Updated Feb 2, 2025

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 19,352 1,888 Updated Feb 2, 2025

Distribute and run LLMs with a single file.

C++ 21,506 1,118 Updated Jan 30, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 117,671 9,388 Updated Feb 2, 2025

LLamaCare is a large medical language model designed for healthcare knowledge sharing.

Python 22 2 Updated Jun 5, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 38,913 5,128 Updated Feb 1, 2025

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 52,913 5,158 Updated Jan 21, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,199 991 Updated Nov 18, 2024

The official Meta Llama 3 GitHub site

Python 28,158 3,250 Updated Jan 26, 2025

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,528 130 Updated Jan 27, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 65,665 7,787 Updated Feb 2, 2025

[ICLR2025] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Python 484 39 Updated Jan 24, 2025

NanoGPT (124M) in 3 minutes

Python 2,181 221 Updated Feb 2, 2025