Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.3k 385

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    3.4k 200

  3. Show-1 Show-1 Public

    Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 62

  4. Show-o Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1k 44

  5. MotionDirector MotionDirector Public

    [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

    Python 845 51

  6. Image2Paragraph Image2Paragraph Public

    [A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

    Python 790 54

Repositories

Showing 10 of 71 repositories
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    showlab/Awesome-Video-Diffusion’s past year of commit activity
    3,430 200 0 0 Updated Nov 14, 2024
  • computer_use_ootb Public

    An out-of-the-box (OOTB) version of Anthropic Claude Computer Use for Windows and macOS

    showlab/computer_use_ootb’s past year of commit activity
    Python 250 MIT 23 8 2 Updated Nov 13, 2024
  • BoxDiff Public

    [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

    showlab/BoxDiff’s past year of commit activity
    Python 250 17 7 0 Updated Nov 12, 2024
  • Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    showlab/Show-o’s past year of commit activity
    Python 1,018 Apache-2.0 44 30 0 Updated Nov 11, 2024
  • sparseformer Public

    (ICLR 2024, CVPR 2024) SparseFormer

    showlab/sparseformer’s past year of commit activity
    Python 63 MIT 2 1 0 Updated Nov 10, 2024
  • LOVA3 Public

    (NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment

    showlab/LOVA3’s past year of commit activity
    Python 63 1 0 0 Updated Nov 7, 2024
  • Awesome-Unified-Multimodal-Models Public

    📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    showlab/Awesome-Unified-Multimodal-Models’s past year of commit activity
    209 9 0 0 Updated Nov 7, 2024
  • VideoLISA Public

    [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

    showlab/VideoLISA’s past year of commit activity
    30 1 2 0 Updated Nov 3, 2024
  • ShowUI Public
    showlab/ShowUI’s past year of commit activity
    4 0 0 0 Updated Oct 31, 2024
  • VisInContext Public

    Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

    showlab/VisInContext’s past year of commit activity
    Python 12 2 1 0 Updated Oct 30, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.