-
Antmicro
- Gdańsk, Poland
Highlights
- Pro
AI/ML
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code…
Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Official Code for Stable Cascade
Enjoy the magic of Diffusion models!
Utilities intended for use with Llama models.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
🔊 Text-Prompted Generative Audio Model
Official inference repo for FLUX.1 models
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge managemen…
📺 Discover the latest machine learning / AI courses on YouTube.
Text-to-Music Generation with Rectified Flow Transformers
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
You like pytorch? You like micrograd? You love tinygrad! ❤️
An open-source RAG-based tool for chatting with your documents.
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.