A Python library for training VLMs using GRPO with Unsloth acceleration.
git clone https://github.com/agentsea/uvg.git
cd uvg
uv sync && uv pip install flash-attn --no-build-isolation && uv pip install -e .To add Qwen-specific deps:
uv pip install -e ".[qwen]"Check the examples/ folder.