Skip to content

🎄MindOne v0.5.0 - Major Release

Latest

Choose a tag to compare

@Fzilan Fzilan released this 24 Dec 15:10
· 1 commit to v0.5.0 since this release

We're excited to announce the official release of MindOne v0.5.0, with enhanced community integration​ and significant performance improvements.

🚀 Key Highlights

  • mindone.diffusers: Compatible with 🤗 diffusers v0.35.2, preview supports for sota v0.36 pipelines
  • mindone.transformers: Compatible with 🤗 transformers v4.57.1
  • ComfyUI: Added initial ComfyUI integration support
  • MindSpore: Compatible with MindSpore 2.6.0 - 2.7.1

mindone.transformers updates

  • Major upgrade: Enhanced compatibility with 🤗 transformers v4.54 and v4.57.1.
  • 70+ new models added: Check support list here.

Base Updates

  • Transformers 4.54 base support (#1387)
  • Transformers 4.57 base support (#1445)

New Models

  • Vision Models: AIMv2 (#1456), DINOv3 ViT/ConvNeXt (v4.57.1) (#1439), SAM-HQ (v4.57.1) (#1457), Bria (#1384), Florence2 (#1453), EfficientLoftr (#1456), HGNet_v2 (#1395), Ovis2 (#1454)

  • Audio/Speech Models: Granite Speech (#1406), Kyutai Speech-to-Text (#1407), Voxtral (#1456), Parakeet (#1451), XCodec (#1452), Dia (#1404), CSM (#1399)

  • Text/Language Models: Llama4 (#1470), Arcee (#1470), Falcon H1 (#1465), Dots1 (#1469), SmolLM3 (v4.54.1) (#1391), ModernBERT Decoder (v4.54.1) (#1397), Hunyuan V1 Dense/MoE (v4.57.1) (#1401), Evolla (v4.54.1) (#1440), EXAONE (#1396), Doge (#1392), ERNIE 4.5 & ERNIE 4.5 MoE (#1393), GLM4 MoE (#1409), Flex OLMo (#1442), T5Gemma (#1420), VaultGemma (#1450), BLT/Apertus/Ministral (#1462), EOMT/TimesFM (#1403), Seed OSS (#1441), xLSTM (#1466), d_fine, GraniteMoeHybrid, EfficientLoFTR Models (#1405)

  • Multimodal Models: Qwen3 Omni (#1411), Qwen3 Next (#1476), ColQwen2 (v4.54.1) (#1414), Cohere2 Vision (v4.57.1) (#1473), InternVL (v4.57) (#1463), Janus (v4.57) (#1463), Kosmos-2.5 (#1456), LFM2/LFM2-VL (#1456), MetaCLIP 2 (#1456), Mlcd (#1472), SAM2 (#1426), SAM2 Video Support (#1434), Olmo3 Model (#1467), DeepseekV2/DeepseekVL/DeepseekVLHybrid (#1477), MM Grounding DINO (#1486)

  • model updates: update Mistral3 to v4.57.1 (#1464), update Qwen2.5VL to v4.54.1 (#1421)

multimodal processors for vllm-mindspore community

  • Qwen2.5VL ImageProcessor Fast / VideoProcessor (#1429)
  • Qwen3_VL Video Processor & Qwen2_VL Image Processor Fast (#1419)
  • Phi4/Whisper/Ultravox/InternVL/Qwen2_audio/MiniCPMV/LLaVA-Next/LLaVA-Next-Video processors (#1471)

mindone.diffusers updates

New Features

  • 🚀 Context parallelism: Ring & Ulysses & Unified Attention (#1438)
  • Added AutoencoderMixin (#1444)

New Pipelines

  • Kandinsky5 (#1388), Lucy (#1390), etc.
  • Enable multi-card Inference for flux2 Pipeline (zero-3 sharding) #1446

ComfyUI Integration

  • Added ComfyUI root files and CLI args (#1480)
  • Added text encoder files (#1481)
  • Updated clip_model.py (#1479)

Examples Updates

  • Added Wan2.2 LoRA finetune support (#1418)
  • Updated Emu3 performance for MindSpore 2.6.0 and 2.7.0 (#1417)
  • Updated HunyuanVideo-I2V to mindspore 2.6.0 and 2.7.0 (#1385)
  • 🚀 Add accelerated dit pipelines compatible with mindspore Graph Mode (#1433)
  • 🚀 Added Fb cache taylorseer graph mode implementation for Flux.1 (#1475)
  • Qwenimage LoRA fintune supports.#1394)

Fixed

  • Fixed AIMv2/Arcee rely on torch bug (#1485)
  • Fixed bugs of mindone.transformers models that rely on torch (#1482)
  • Fixed Qwen2.5VLProcessor tokenizer converting tensor bug (#1483)
  • Fixed Qwen3_VL text attention selection bug (#1455)
  • Fixed GLM4.1V bs>1 generation index bug (#1437)
  • Fixed training issue in TrainOneStepWrapper (#1408)
  • Fixed import error if env contains accelerate module (#1431)
  • ZeRO: Support training with MS 2.6.0 and 2.7.0 (#1383)
  • Misc bugfixes (#1424)
  • Fixed some diffusers bugs (#1448)
  • Docs updates for mindone v0.5.0 release, and ut fixes (#1484)

Statistics

  • Total commits: 82
  • Files changed: 798
  • Lines added: 157,122
  • Lines deleted: 22,303

🙏 Acknowledgments

Special thanks to our amazing contributors who helped shape MindOne v0.5.0!

Andy Zhou, Chaoran Wei, Cheung Ka Wai, Cui-yshoho, Didan Deng, Feiran Zhang, Fzilan, GUOGUO, Rustam Khadipash, The-truthh, YMC, Yingshu CHEN, alien-0119, jijiarong, liuchuting, vigo999, zackcxb, zyd-ustc

Together We Build, Together We Grow. Thanks to every open source maintainer, contributor, and user. ✨

Start your AI model development journey with MindOne v0.5.0 today! 🚀

đź“– Full Changelog: CHANGELOG.md