swift
Get Started
SWIFT Installation
Quick Start
Web-UI
Instruction
Command Line Parameters
Pre-training and Fine-tuning
GRPO
Get Started
Developer Guide
Multi-turn Training
Multi-Task Training
Reward Function
Reward Model
GYM Environment Training
Advanced Research
RLHF
Inference and Deployment
Sampling
Evaluation
Export and Push
Reinforced Fine-Tuning
Agent Support
Supported Models and Datasets
Using Tuners
Frequently-asked-questions
Megatron-SWIFT
Quick Start
Command Line Arguments
LoRA Training
Multimodal Models
Customization
Custom Model
Custom Dataset
Pluginization
Best Practices
Complete GRPO Experiment Process
Complete Multimodal GRPO Experiment Workflow
Code Training with GRPO
Qwen3 Best Practices
Qwen3-VL Best Practices
Best Practices for Registering Multimodal Models
Embedding Training
Reranker Training
Best Practices for Rapidly Training Vision-Language (VL) Models
NPU Support
More Best Practices
swift
GRPO
Developer Guide
View page source
Developer Guide
Multi-turn Training
Multi-Task Training
Reward Function
Reward Model
GYM Environment Training