swift

Get Started

SWIFT Installation
Quick Start
Web-UI

Instruction

Command Line Parameters
Pre-training and Fine-tuning
GRPO
GKD
RLHF
Inference and Deployment
Sampling
Evaluation
Export and Push
Ray Support
Reinforced Fine-Tuning
Agent Support
Supported Models and Datasets
Using Tuners
Frequently-asked-questions

Megatron-SWIFT

Quick Start
Command Line Arguments
LoRA Training
Multimodal Models
Mcore Bridge
Megatron GRPO
GKD
Ascend NPU
NPU Accuracy Data Collection
Custom Megatron Model

Customization

Architecture Introduction
Custom Model
Custom Dataset

Best Practices

Complete GRPO Experiment Process
Complete Multimodal GRPO Experiment Workflow
Code Training with GRPO
Qwen3 Best Practices
Qwen3-VL Best Practices
Qwen3.5 Best Practices
DeepSeek-V4 Training Support
Best Practices for Registering Multimodal Models
Embedding Training
Reranker Training
Best Practices for Rapidly Training Vision-Language (VL) Models
NPU Support
Metax Support
AMD GPU Support
More Best Practices

swift

GRPO
View page source

GRPO

Get Started

Get Started
- GRPO

Developer Guide

Developer Guide

Advanced Research

Advanced Research

Previous Next

© Copyright 2022-2025, Alibaba ModelScope.

Built with Sphinx using a theme provided by Read the Docs.