Stars
Unified framework for robot learning built on NVIDIA Isaac Sim
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A bibliography and survey of the papers surrounding o1
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
WildEval / ZeroEval
Forked from allenai/WildBenchA simple unified framework for evaluating LLMs
Benchmarking LLMs with Challenging Tasks from Real Users
Apache Spark - A unified analytics engine for large-scale data processing
DuckDB is an analytical in-process SQL database management system
Implementation of Nougat Neural Optical Understanding for Academic Documents
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
[COLM 2024] A Survey on Deep Learning for Theorem Proving
Open-Sora: Democratizing Efficient Video Production for All
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge managemen…
✨ Local and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model