Pulse · modelscope/ms-swift

December 18, 2024 – December 25, 2024

Overview

32 Active pull requests

49 Active issues

1 Release published by 1 person

v3.0.0
published Dec 23, 2024

31 Pull requests merged by 3 people

fix alpaca
#2771 merged Dec 26, 2024
support modern_bert & support bert deploy
#2767 merged Dec 26, 2024
fix app-ui
#2765 merged Dec 25, 2024
fix shell
#2764 merged Dec 25, 2024
fix bugs
#2761 merged Dec 25, 2024
fix web-ui
#2758 merged Dec 25, 2024
support SequenceClassification & update QVQ-72B-Preview
#2747 merged Dec 24, 2024
fix docs multimodal
#2742 merged Dec 24, 2024
Fix windows encoding gbk
#2741 merged Dec 24, 2024
support AI-ModelScope/Skywork-o1-Open-Llama-3.1-8B
#2739 merged Dec 23, 2024
support multi-modal llamapro
#2738 merged Dec 23, 2024
fix windows
#2733 merged Dec 23, 2024
support paligemma2
#2735 merged Dec 23, 2024
remove files
#2732 merged Dec 23, 2024
update examples
#2730 merged Dec 23, 2024
support iic/DocOwl2
#2728 merged Dec 23, 2024
Support more internvl2.5 awq/mpo & internvl2 pretrain model
#2726 merged Dec 22, 2024
Support qwen agent format
#2722 merged Dec 21, 2024
fix batch_infer pad_token & florence
#2725 merged Dec 21, 2024
Fix mplug owl2, molmo
#2724 merged Dec 21, 2024
Better error messages
#2721 merged Dec 20, 2024
fix gptq group_size
#2720 merged Dec 20, 2024
fix examples
#2719 merged Dec 20, 2024
fix deploy request_config
#2718 merged Dec 20, 2024
update examples
#2714 merged Dec 20, 2024
support Qwen/QVQ-72B-Preview
#2712 merged Dec 19, 2024
Fix multi lora
#2711 merged Dec 19, 2024
fix timeout & web-ui
#2709 merged Dec 19, 2024
qwen to Qwen
#2708 merged Dec 19, 2024
Update FAQ
#2706 merged Dec 19, 2024
fix eval strategy
#2707 merged Dec 19, 2024

17 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

能否集成一些自动化调参工具，并且提供一下3.0的pip包，现在download还是2.6.1版本的包。
#2705 commented on Dec 19, 2024 • 0 new comments
merge multi loras
#2702 commented on Dec 19, 2024 • 0 new comments
多模态模型多轮对话训练数据格式
#2673 commented on Dec 20, 2024 • 0 new comments
BUG: 训练集中包含特殊字符<unk>时有报错
#2688 commented on Dec 23, 2024 • 0 new comments
3.0 --device_max_memory
#2655 commented on Dec 23, 2024 • 0 new comments
Full SFT 的EMA支持
#2685 commented on Dec 23, 2024 • 0 new comments
v2.3.2以后InternVL2-26B训练比之前慢15%左右
#2502 commented on Dec 23, 2024 • 0 new comments
GOT-OCR 2.0 训练新的prompt数据后发现推理没有任何效果
#2259 commented on Dec 24, 2024 • 0 new comments
qwen2-vl 系列无法awq量化
#2649 commented on Dec 24, 2024 • 0 new comments
Best Practices for Inference and Fine-Tuning with MiniCPM-V 2.6
#1613 commented on Dec 24, 2024 • 0 new comments
pretrain报错进度异常问题
#2692 commented on Dec 24, 2024 • 0 new comments
Best practice for Qwen2-Audio
#1653 commented on Dec 24, 2024 • 0 new comments
lora 微调的模型使用--resume_from_checkpoint参数，继续训练报显存不足；不使用--resume_from_checkpoint参数可以正常训练
#2505 commented on Dec 25, 2024 • 0 new comments
Visualization of Grounding Tasks
#2635 commented on Dec 25, 2024 • 0 new comments
mplug-owl3-7b-chat fine-tuning document
#1969 commented on Dec 25, 2024 • 0 new comments
qwen2-vl 的 pretrain 是否支持
#2222 commented on Dec 26, 2024 • 0 new comments
ms-swift3 Suggestion Box
#2217 commented on Dec 26, 2024 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

December 18, 2024 – December 25, 2024

Overview

Could not load contribution data

1 Release published by 1 person

31 Pull requests merged by 3 people

1 Pull request opened by 1 person

24 Issues closed by 8 people

25 Issues opened by 22 people

17 Unresolved conversations

Insights: modelscope/ms-swift

December 18, 2024 – December 25, 2024

Overview

Could not load contribution data

1 Release published by 1 person

31 Pull requests merged by 3 people

1 Pull request opened by 1 person

24 Issues closed by 8 people

25 Issues opened by 22 people

17 Unresolved conversations