swift
Get Started
SWIFT Installation
Quick Start
Web-UI
Instruction
Command Line Parameters
Pre-training and Fine-tuning
GRPO
Get Started
Developer Guide
Multi-Turn Rollout
Multi-Task Training
Reward Function
Reward Model
Advanced Research
RLHF
Inference and Deployment
Megatron-SWIFT Training
Sampling
Evaluation
Export and Push
Reinforced Fine-Tuning
Agent Support
Supported Models and Datasets
Using Tuners
Frequently-asked-questions
Customization
Custom Model
Custom Dataset
Pluginization
Best Practices
Complete GRPO Experiment Process
Complete Multimodal GRPO Experiment Workflow
Code Training with GRPO
Qwen3 Best Practices
Embedding Training
Reranker Training
Best Practices for Rapidly Training Vision-Language (VL) Models
NPU Support
More Best Practices
swift
GRPO
Developer Guide
View page source
Developer Guide
Multi-Turn Rollout
Multi-Task Training
Reward Function
Reward Model