🎄MindOne v0.5.0 - Major Release

We're excited to announce the official release of MindOne v0.5.0, with enhanced community integration and significant performance improvements.

🚀 Key Highlights

mindone.diffusers: Compatible with 🤗 diffusers v0.35.2, preview supports for sota v0.36 pipelines
mindone.transformers: Compatible with 🤗 transformers v4.57.1
ComfyUI: Added initial ComfyUI integration support
MindSpore: Compatible with MindSpore 2.6.0 - 2.7.1

mindone.transformers updates

Major upgrade: Enhanced compatibility with 🤗 transformers v4.54 and v4.57.1.
70+ new models added: Check support list here.

Base Updates

Transformers 4.54 base support (#1387)
Transformers 4.57 base support (#1445)

New Models

Vision Models: AIMv2 (#1456), DINOv3 ViT/ConvNeXt (v4.57.1) (#1439), SAM-HQ (v4.57.1) (#1457), Bria (#1384), Florence2 (#1453), EfficientLoftr (#1456), HGNet_v2 (#1395), Ovis2 (#1454)
Audio/Speech Models: Granite Speech (#1406), Kyutai Speech-to-Text (#1407), Voxtral (#1456), Parakeet (#1451), XCodec (#1452), Dia (#1404), CSM (#1399)
Text/Language Models: Llama4 (#1470), Arcee (#1470), Falcon H1 (#1465), Dots1 (#1469), SmolLM3 (v4.54.1) (#1391), ModernBERT Decoder (v4.54.1) (#1397), Hunyuan V1 Dense/MoE (v4.57.1) (#1401), Evolla (v4.54.1) (#1440), EXAONE (#1396), Doge (#1392), ERNIE 4.5 & ERNIE 4.5 MoE (#1393), GLM4 MoE (#1409), Flex OLMo (#1442), T5Gemma (#1420), VaultGemma (#1450), BLT/Apertus/Ministral (#1462), EOMT/TimesFM (#1403), Seed OSS (#1441), xLSTM (#1466), d_fine, GraniteMoeHybrid, EfficientLoFTR Models (#1405)
Multimodal Models: Qwen3 Omni (#1411), Qwen3 Next (#1476), ColQwen2 (v4.54.1) (#1414), Cohere2 Vision (v4.57.1) (#1473), InternVL (v4.57) (#1463), Janus (v4.57) (#1463), Kosmos-2.5 (#1456), LFM2/LFM2-VL (#1456), MetaCLIP 2 (#1456), Mlcd (#1472), SAM2 (#1426), SAM2 Video Support (#1434), Olmo3 Model (#1467), DeepseekV2/DeepseekVL/DeepseekVLHybrid (#1477), MM Grounding DINO (#1486)
model updates: update Mistral3 to v4.57.1 (#1464), update Qwen2.5VL to v4.54.1 (#1421)

multimodal processors for vllm-mindspore community

Qwen2.5VL ImageProcessor Fast / VideoProcessor (#1429)
Qwen3_VL Video Processor & Qwen2_VL Image Processor Fast (#1419)
Phi4/Whisper/Ultravox/InternVL/Qwen2_audio/MiniCPMV/LLaVA-Next/LLaVA-Next-Video processors (#1471)

mindone.diffusers updates

New Features

🚀 Context parallelism: Ring & Ulysses & Unified Attention (#1438)
Added AutoencoderMixin (#1444)

New Pipelines

Kandinsky5 (#1388), Lucy (#1390), etc.
Enable multi-card Inference for flux2 Pipeline (zero-3 sharding) #1446

ComfyUI Integration

Added ComfyUI root files and CLI args (#1480)
Added text encoder files (#1481)
Updated clip_model.py (#1479)

Examples Updates

Added Wan2.2 LoRA finetune support (#1418)
Updated Emu3 performance for MindSpore 2.6.0 and 2.7.0 (#1417)
Updated HunyuanVideo-I2V to mindspore 2.6.0 and 2.7.0 (#1385)
🚀 Add accelerated dit pipelines compatible with mindspore Graph Mode (#1433)
🚀 Added Fb cache taylorseer graph mode implementation for Flux.1 (#1475)
Qwenimage LoRA fintune supports.#1394)

Fixed

Fixed AIMv2/Arcee rely on torch bug (#1485)
Fixed bugs of mindone.transformers models that rely on torch (#1482)
Fixed Qwen2.5VLProcessor tokenizer converting tensor bug (#1483)
Fixed Qwen3_VL text attention selection bug (#1455)
Fixed GLM4.1V bs>1 generation index bug (#1437)
Fixed training issue in TrainOneStepWrapper (#1408)
Fixed import error if env contains accelerate module (#1431)
ZeRO: Support training with MS 2.6.0 and 2.7.0 (#1383)
Misc bugfixes (#1424)
Fixed some diffusers bugs (#1448)
Docs updates for mindone v0.5.0 release, and ut fixes (#1484)

Statistics

Total commits: 82
Files changed: 798
Lines added: 157,122
Lines deleted: 22,303

🙏 Acknowledgments

Special thanks to our amazing contributors who helped shape MindOne v0.5.0!

Andy Zhou, Chaoran Wei, Cheung Ka Wai, Cui-yshoho, Didan Deng, Feiran Zhang, Fzilan, GUOGUO, Rustam Khadipash, The-truthh, YMC, Yingshu CHEN, alien-0119, jijiarong, liuchuting, vigo999, zackcxb, zyd-ustc

Together We Build, Together We Grow. Thanks to every open source maintainer, contributor, and user. ✨

Start your AI model development journey with MindOne v0.5.0 today! 🚀

📖 Full Changelog: CHANGELOG.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly