swift

Get Started

  • SWIFT Installation
  • Quick Start
  • Web-UI

Instruction

  • Command Line Parameters
  • Pre-training and Fine-tuning
  • GRPO
  • GKD
  • RLHF
  • Inference and Deployment
  • Sampling
  • Evaluation
  • Export and Push
  • Ray Support
  • Reinforced Fine-Tuning
  • Agent Support
  • Supported Models and Datasets
  • Using Tuners
  • Frequently-asked-questions

Megatron-SWIFT

  • Quick Start
  • Command Line Arguments
  • LoRA Training
  • Multimodal Models
  • Mcore Bridge
  • Megatron GRPO
  • GKD
  • Ascend NPU
  • NPU Accuracy Data Collection
  • Custom Megatron Model

Customization

  • Architecture Introduction
  • Custom Model
  • Custom Dataset

Best Practices

  • Complete GRPO Experiment Process
  • Complete Multimodal GRPO Experiment Workflow
  • Code Training with GRPO
  • Qwen3 Best Practices
  • Qwen3-VL Best Practices
  • Qwen3.5 Best Practices
  • DeepSeek-V4 Training Support
  • Best Practices for Registering Multimodal Models
  • Embedding Training
  • Reranker Training
  • Best Practices for Rapidly Training Vision-Language (VL) Models
  • NPU Support
  • Metax Support
  • AMD GPU Support
  • More Best Practices
swift
  • Search


© Copyright 2022-2025, Alibaba ModelScope.

Built with Sphinx using a theme provided by Read the Docs.