This repository contains SoTA algorithms, models, and interesting projects in the area of multimodal understanding and content generation
ONE is short for "ONE for all"
Hello MindSpore from Stable Diffusion 3!
-
mindone/diffusers now supports Stable Diffusion 3. Give it a try yourself!
import mindspore from mindone.diffusers import StableDiffusion3Pipeline pipe = StableDiffusion3Pipeline.from_pretrained( "stabilityai/stable-diffusion-3-medium-diffusers", mindspore_dtype=mindspore.float16, ) prompt = "A cat holding a sign that says 'Hello MindSpore'" image = pipe(prompt)[0][0] image.save("sd3.png")
model | features |
---|---|
cambrian | working on it |
minicpm-v | working on v2.6 |
internvl | working on v1.0 v1.5 v2.0 |
llava | working on llava 1.5 & 1.6 |
vila | working on it |
pllava | working on it |
hpcai open sora | support v1.0/1.1/1.2 large scale training with dp/sp/zero |
open sora plan | support v1.0/1.1/1.2 large scale training with dp/sp/zero |
stable diffusion | support sd 1.5/2.0/2.1, vanilla fine tune, lora, dreambooth, text inversion |
stable diffusion xl | support sai style(stability AI) vanilla fine tune, lora, dreambooth |
dit | support text to image fine tune |
latte | support uncondition text to image fine tune |
animate diff | support motion module and lora training |
video composer | support conditional video generation with motion transfer and etc. |
ip adapter | refactoring |
t2i-adapter | refactoring |
mindone diffusers is under active development, most tasks were tested with mindspore 2.2.10 and ascend 910 hardware.
component | features |
---|---|
pipeline | support text2image,text2video,text2audio tasks 30+ |
models | support audoencoder & transformers base models same as hf diffusers |
schedulers | support ddpm & dpm solver 10+ schedulers same as hf diffusers |
- mindspore 2.3.0 version adaption
- hf diffusers 0.30.0 version adaption