Enhance support for VL models #501

chenyushuo · 2026-02-09T04:16:56Z

Description

Modified the tokenizer in TinkerModel to directly retrieve it from the sample client.
Updated the SFTFormatter, vLLMRolloutModel, SimpleMMWorkflow, mm_utils.py, verl_trainer.py and trainer/verl/utils.py to enhance multimodal processing capabilities, enabling compatibility with Qwen-series vision-language models and moonshotai/Kimi-VL-A3B-Thinking.
Remove pad_token_id in config.
Add trust_remote_code to config.
Add get_model_class for fsdp worker and fsdp checkpoint manager.
Bug fix in chord_policy_loss.py.

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

…om the sample client. 2. Updated the `SFTFormatter`, `vLLMRolloutModel`, `SimpleMMWorkflow`, `mm_utils.py`, `verl_trainer.py` and `trainer/verl/utils.py` to enhance multimodal processing capabilities, enabling compatibility with Qwen-series vision-language models.

gemini-code-assist · 2026-02-09T04:17:25Z

Summary of Changes

Hello @chenyushuo, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the framework's support for Vision-Language (VL) models, particularly the Qwen-series. The changes streamline how multimodal inputs are processed and handled throughout the system, from data formatting and model interaction to training. By integrating multimodal capabilities directly into core chat and generate methods and updating underlying utility functions, the PR aims to simplify the development and deployment of VL models while ensuring compatibility with newer versions of vLLM and transformers.

Highlights

Enhanced Multimodal Processing: The core framework has been significantly updated to improve support for Vision-Language (VL) models, particularly the Qwen-series. This includes refactoring data formatting and integrating new utility functions for multimodal input handling.
Streamlined API for Multimodal Inputs: Several multimodal-specific API methods (e.g., generate_mm, chat_mm) in vLLMRolloutModel and ModelWrapper have been deprecated. Multimodal input handling is now integrated directly into the general chat and generate methods, simplifying model interaction.
Dependency Updates: The required versions for vllm and transformers have been updated across example configurations and pyproject.toml to ensure compatibility with the latest features and improvements.
Tokenizer Initialization Improvement: The TinkerModel now directly retrieves its tokenizer from the model instance, simplifying the initialization process.
Integrated Multimodal Training Data Handling: The verl_trainer now uses a more integrated approach for processing multimodal training data, leveraging the model's processor to correctly handle position IDs and multimodal inputs during the conversion to DataProto.

Changelog

examples/grpo_vlm/README.md
- Updated vllm requirement from >=0.9.1,<0.10.0 to >=0.10.2.
- Updated transformers requirement from <4.53.0 to >=4.54.0.
examples/mix_vlm/README.md
- Updated vllm requirement from >=0.9.1,<0.10.0 to >=0.10.2.
- Updated transformers requirement from <4.53.0 to >=4.54.0.
pyproject.toml
- Updated tinker dependency to tinker>=0.10.0.
- Added transformers>=4.54.0 to the mm extra dependencies.
trinity/buffer/schema/formatter.py
- Removed fallback to self.tokenizer.chat_template for self.chat_template initialization.
- Removed mm_data parameter from _messages_to_experience method signature.
- Refactored multimodal experience conversion logic, removing load_mm_data and convert_mm_data_to_experiences and integrating new mm_utils functions.
- Modified format method to use build_mm_message for user content when multimodal data is present.
trinity/common/models/mm_utils.py
- Updated comment to include support for Qwen2.5/3 VL series.
- Introduced build_multi_modal_data for preprocessing multimodal data.
- Introduced build_mm_input_for_training for creating multimodal inputs for training.
- Introduced build_mm_message for constructing multimodal messages.
- Introduced has_multi_modal_content to check for multimodal content in messages.
- Deprecated build_multi_modal_inputs and convert_messages_to_mm_format.
trinity/common/models/model.py
- Added @Deprecated decorator to generate_mm, generate_mm_async, chat_mm, and chat_mm_async methods.
trinity/common/models/tinker_model.py
- Modified _initialize_tokenizer to retrieve the tokenizer directly from self.model instead of trainer_client.
trinity/common/models/vllm_model.py
- Updated imports to include Union and new mm_utils functions.
- Made _initialize_processor an async method and ensured it calls _initialize_tokenizer.
- Refactored chat method to handle multimodal messages by dynamically using the processor and preparing multimodal prompts.
- Modified generate method to accept Union[str, Dict] for prompt, allowing it to process both text and multimodal inputs, and to include multi_modal_inputs in the Experience object.
- Added @Deprecated decorator to chat_mm and generate_mm methods.
trinity/common/workflows/simple_mm_workflow.py
- Imported build_mm_message.
- Added format_messages method to construct messages using build_mm_message.
- Simplified reset method to directly access images/videos from the raw task data.
- Removed explicit calls to chat_mm and chat_mm_async, relying on the refactored chat and chat_async methods to handle multimodal content.
trinity/trainer/verl/utils.py
- Imported ProcessorMixin.
- Modified to_data_proto to accept a processor argument.
- Updated to_data_proto to use the provided processor for calculating position_ids and handling multi_modal_inputs for multimodal experiences.
trinity/trainer/verl_trainer.py
- Removed DataProto import.
- Removed self.pad_token_id attribute initialization.
- Updated train_step to pass self.tokenizer.pad_token_id and self.processor to to_data_proto.
- Removed the post_process_batch method.

Activity

The pull request is currently marked as 'Work In Progress' (WIP).
The author has provided a checklist for code readiness, indicating ongoing development and self-review.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request significantly enhances support for Vision-Language models by refactoring the multi-modal data processing pipeline. The changes centralize logic, unify text and multi-modal generation paths, and simplify higher-level components like workflows, which is a great architectural improvement. However, I've identified a critical bug due to a missing import, along with some high-severity concerns regarding potential data loss of attention_mask and inadequate error handling that could lead to an IndexError. Please review the detailed comments for suggestions on how to address these issues.

trinity/common/models/mm_utils.py

trinity/buffer/schema/formatter.py

trinity/common/models/vllm_model.py

2. Fix `GPUMemoryValidator` for VL model

2. Remove `pad_token_id` in config. 3. Add `trust_remote_code` to config. 4. Add `get_model_class` for fsdp worker and fsdp checkpoint manager.

…ance_vl

chenyushuo · 2026-02-11T04:28:32Z

/unittest-module-trainer

github-actions · 2026-02-11T05:14:58Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
27	22	1	4	0	0	44m 1s

Failed Tests

Failed Tests ❌	Fail Message
❌ tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	The test failed in the call phase due to an assertion error

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestMultiModalSFT::test_trainer	skipped ⏭️
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	skipped ⏭️
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	skipped ⏭️
tests/trainer/trainer_test.py::AgentScopeTunerTest::test_agentscope_tuner	skipped ⏭️

Tests

Test Name	Status	Duration
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	4m 5s
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	5m 15s
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	1m 40s
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	1m 6s
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	1m 5s
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	1m 5s
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	1m 9s
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	❌	1.1s
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	33.3s
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	29.9s
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	29.9s
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	1m 35s
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	1m 39s
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	2m 24s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_0_fsdp::test_trainer	✅	2m 53s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_1_megatron::test_trainer	✅	5m 43s
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	1m 52s
tests/trainer/trainer_test.py::TestServeWithTrainer::test_serve_with_trainer	✅	1m 47s
tests/trainer/trainer_test.py::TestMultiModalGRPO::test_trainer	✅	1m 52s
tests/trainer/trainer_test.py::TestMultiModalSFT::test_trainer	⏭️	1.1s
tests/trainer/trainer_test.py::TestTrainerLoRA::test_trainer	✅	3m 16s
tests/trainer/trainer_test.py::TestOverRollout::test_trainer	✅	1m 2s
tests/trainer/trainer_test.py::TestTrainerPromptTruncation::test_trainer	✅	47.5s
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	⏭️	1ms
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	⏭️	1ms
tests/trainer/trainer_test.py::AgentScopeTunerTest::test_agentscope_tuner	⏭️	1ms
tests/trainer/trainer_test.py::ColocateModeTest::test_trainer	✅	2m

Github Test Reporter by CTRF 💚

chenyushuo · 2026-02-11T05:26:31Z

/unittest-module-trainer

github-actions · 2026-02-11T06:13:45Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
27	22	2	3	0	0	44m 50s

Failed Tests

Failed Tests ❌	Fail Message
❌ tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	The test failed in the call phase due to an assertion error
❌ tests/trainer/trainer_test.py::TestMultiModalSFT::test_trainer	The test failed in the call phase due to an assertion error

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	skipped ⏭️
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	skipped ⏭️
tests/trainer/trainer_test.py::AgentScopeTunerTest::test_agentscope_tuner	skipped ⏭️

Tests

Test Name	Status	Duration
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	4m
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	5m 16s
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	1m 47s
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	1m 9s
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	1m 1s
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	1m 7s
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	1m 9s
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	❌	1.4s
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	32.5s
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	29.8s
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	29.8s
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	1m 40s
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	1m 37s
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	2m 24s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_0_fsdp::test_trainer	✅	2m 53s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_1_megatron::test_trainer	✅	5m 53s
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	2m 2s
tests/trainer/trainer_test.py::TestServeWithTrainer::test_serve_with_trainer	✅	1m 45s
tests/trainer/trainer_test.py::TestMultiModalGRPO::test_trainer	✅	1m 54s
tests/trainer/trainer_test.py::TestMultiModalSFT::test_trainer	❌	36.2s
tests/trainer/trainer_test.py::TestTrainerLoRA::test_trainer	✅	3m 14s
tests/trainer/trainer_test.py::TestOverRollout::test_trainer	✅	56.1s
tests/trainer/trainer_test.py::TestTrainerPromptTruncation::test_trainer	✅	47.3s
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	⏭️	1ms
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	⏭️	1ms
tests/trainer/trainer_test.py::AgentScopeTunerTest::test_agentscope_tuner	⏭️	1ms
tests/trainer/trainer_test.py::ColocateModeTest::test_trainer	✅	1m 56s

Github Test Reporter by CTRF 💚

…ance_vl

chenyushuo · 2026-02-11T07:47:58Z

/unittest-module-trainer

github-actions · 2026-02-11T08:39:10Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
27	25	0	2	0	0	48m 51s

Skipped

Tests	Status
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	skipped ⏭️
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	skipped ⏭️

Tests

Test Name	Status	Duration
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	4m 8s
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	5m 12s
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	1m 36s
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	1m 7s
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	1m 3s
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	1m 7s
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	1m 12s
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	✅	2m 17s
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	35.3s
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	29.4s
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	31.9s
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	1m 37s
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	1m 38s
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	2m 24s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_0_fsdp::test_trainer	✅	2m 53s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_1_megatron::test_trainer	✅	5m 53s
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	1m 56s
tests/trainer/trainer_test.py::TestServeWithTrainer::test_serve_with_trainer	✅	1m 43s
tests/trainer/trainer_test.py::TestMultiModalGRPO::test_trainer	✅	1m 54s
tests/trainer/trainer_test.py::TestMultiModalSFT::test_trainer	✅	1m 6s
tests/trainer/trainer_test.py::TestTrainerLoRA::test_trainer	✅	3m 9s
tests/trainer/trainer_test.py::TestOverRollout::test_trainer	✅	1m 1s
tests/trainer/trainer_test.py::TestTrainerPromptTruncation::test_trainer	✅	44.8s
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	⏭️	1ms
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	⏭️	1ms
tests/trainer/trainer_test.py::AgentScopeTunerTest::test_agentscope_tuner	✅	1m 13s
tests/trainer/trainer_test.py::ColocateModeTest::test_trainer	✅	2m 6s

Github Test Reporter by CTRF 💚

trinity/common/patch/kimi.py

trinity/algorithm/policy_loss_fn/chord_policy_loss.py

examples/grpo_vlm/README.md

…ance_vl

chenyushuo · 2026-02-11T10:04:12Z

/unittest-all

chenyushuo · 2026-02-11T11:52:19Z

/unittest-module-algorithm

chenyushuo · 2026-02-11T11:52:27Z

/unittest-module-buffer

chenyushuo · 2026-02-11T11:52:36Z

/unittest-module-cli

chenyushuo · 2026-02-11T11:52:46Z

/unittest-module-common

chenyushuo · 2026-02-11T11:52:57Z

/unittest-module-explorer

chenyushuo · 2026-02-11T11:53:08Z

/unittest-module-manager

chenyushuo · 2026-02-11T11:53:23Z

/unittest-module-service

chenyushuo · 2026-02-11T11:53:30Z

/unittest-module-utils

chenyushuo · 2026-02-11T11:53:35Z

/unittest-module-trainer

github-actions · 2026-02-11T11:54:39Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
27	27	0	0	0	0	3.1s

Tests

Test Name	Status	Duration
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_batch_level_std_grpo	✅	6ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_batch_level_step_wise_grpo_advantage	✅	3ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_duplicate_grpo	✅	5ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_advantage	✅	3ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_correct_bias	✅	2ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_reward_std	✅	1ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_step_wise_grpo_advantage	✅	1ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_step_wise_grpo_with_std_threshold	✅	2ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_abs_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_fallback	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_loss	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_same_policy	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_with_old_logprob	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_dummy_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_k1_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_k2_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_k3_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_kl_loss_aggregation_modes	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_low_var_kl_fn	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_dpo_policy_loss	✅	2ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_gspo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_mix_policy_loss	✅	3ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_opmd_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_ppo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_ppo_policy_loss_with_sequence_masking	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_sapo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_sft_policy_loss	✅	1ms

Github Test Reporter by CTRF 💚

github-actions · 2026-02-11T11:58:47Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
48	48	0	0	0	0	1m 43s

Tests

Test Name	Status	Duration
tests/buffer/experience_pipeline_test.py::TestExperiencePipeline::test_experience_pipeline	✅	11.5s
tests/buffer/experience_pipeline_test.py::TestExperiencePipeline::test_pass_rate_calculation	✅	5.8s
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_experience_buffer	✅	2.3s
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_storage_0_sft	✅	4.3s
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_storage_1_dpo	✅	4.6s
tests/buffer/file_test.py::TestFileBuffer::test_file_reader	✅	431ms
tests/buffer/file_test.py::TestFileBuffer::test_file_writer	✅	1.5s
tests/buffer/formatter_test.py::TestFormatter::test_dpo_messages_formatter	✅	558ms
tests/buffer/formatter_test.py::TestFormatter::test_dpo_plaintext_formatter	✅	483ms
tests/buffer/formatter_test.py::TestFormatter::test_multi_modal_sft_formatter	✅	841ms
tests/buffer/formatter_test.py::TestFormatter::test_sft_messages_formatter	✅	993ms
tests/buffer/formatter_test.py::TestFormatter::test_sft_plaintext_formatter	✅	733ms
tests/buffer/formatter_test.py::TestFormatter::test_task_formatter	✅	231ms
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_buffer_reuse	✅	6.5s
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_capacity	✅	2.3s
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_reuse_count_control	✅	4.2s
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_0_queue	✅	3.2s
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_1_priority_queue	✅	3.1s
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_capacity	✅	3.8s
tests/buffer/reader_test.py::TestBufferReader::test_buffer_reader_registration	✅	745ms
tests/buffer/reward_shaping_mapper_test.py::TestRewardShapingMapper::test_basic_usage	✅	6ms
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_default_queue_default_sample_strategy	✅	1.8s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_default_queue_staleness_control_sample_strategy	✅	1.7s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_priority_queue_default_sample_strategy	✅	1.8s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_priority_queue_staleness_control_sample_strategy	✅	1.7s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_sql_staleness_control_sample_strategy	✅	4.0s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_default_queue_default_sample_strategy	✅	2.0s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_default_queue_staleness_control_sample_strategy	✅	1.8s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_priority_queue_default_sample_strategy	✅	1.6s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_priority_queue_staleness_control_sample_strategy	✅	1.6s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_sql_staleness_control_sample_strategy	✅	3.1s
tests/buffer/sql_test.py::TestSQLBuffer::test_sql_exp_buffer_read_write_0	✅	5.3s
tests/buffer/sql_test.py::TestSQLBuffer::test_sql_exp_buffer_read_write_1	✅	2.3s
tests/buffer/sql_test.py::TestSQLBuffer::test_sql_task_buffer_read_write	✅	2.7s
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_0	✅	71ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_1	✅	56ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_2	✅	89ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_3	✅	89ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_4	✅	89ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_5	✅	92ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_6	✅	106ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_simple	✅	46ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_0_file	✅	262ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_1_sql	✅	2.8s
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_2_file	✅	41ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_3_sql	✅	2.7s
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_4_file	✅	42ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_5_sql	✅	3.1s

Github Test Reporter by CTRF 💚

github-actions · 2026-02-11T12:04:04Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
5	5	0	0	0	0	1m 13s

Tests

Test Name	Status	Duration
tests/cli/launcher_test.py::TestLauncherMain::test_debug_mode	✅	49.9s
tests/cli/launcher_test.py::TestLauncherMain::test_main_run_command	✅	5.8s
tests/cli/launcher_test.py::TestLauncherMain::test_main_run_in_dlc	✅	1.2s
tests/cli/launcher_test.py::TestLauncherMain::test_main_studio_command	✅	914ms
tests/cli/launcher_test.py::TestLauncherMain::test_multi_stage_run	✅	14.4s

Github Test Reporter by CTRF 💚

github-actions · 2026-02-11T12:17:19Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
55	54	0	1	0	0	10m 42s

Skipped

Tests	Status
tests/common/vllm_test.py::TestTinkerAsyncAPIServer::test_api_async	skipped ⏭️

Tests

Test Name	Status	Duration
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	22.7s
tests/common/config_test.py::TestConfig::test_chat_template_path	✅	77ms
tests/common/config_test.py::TestConfig::test_config_flatten	✅	32ms
tests/common/config_test.py::TestConfig::test_continue_from_checkpoint_is_valid	✅	161ms
tests/common/config_test.py::TestConfig::test_default_workflow	✅	76ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	3.9s
tests/common/config_test.py::TestConfig::test_max_token_len_per_gpu_set_correctly	✅	77ms
tests/common/config_test.py::TestConfig::test_optimizer_config_propagation	✅	77ms
tests/common/config_test.py::TestConfig::test_update_config_from_ray_cluster	✅	1.6s
tests/common/experience_test.py::TestEID::test_eid_properties	✅	1ms
tests/common/experience_test.py::TestExperience::test_action_mask_and_logprobs_type	✅	1ms
tests/common/experience_test.py::TestExperience::test_assertions	✅	1ms
tests/common/experience_test.py::TestExperience::test_dpo_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_gather	✅	1ms
tests/common/experience_test.py::TestExperience::test_gather_with_token_level_reward	✅	1ms
tests/common/experience_test.py::TestExperience::test_hf_datasets_conversion	✅	14ms
tests/common/experience_test.py::TestExperience::test_multi_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_deserialize	✅	1ms
tests/common/experience_test.py::TestExperience::test_single_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_to_dict	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_dpo_experience_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_gather_experiences_with_custom_fields	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_multiturn_experience_batch_converstion	✅	1ms
tests/common/sudoku_test.py::test_9x9_generator_produces_valid_solution	✅	1ms
tests/common/sudoku_test.py::test_9x9_generator_creates_holes	✅	1ms
tests/common/sudoku_test.py::test_9x9_solution_is_fully_filled	✅	1ms
tests/common/sudoku_test.py::test_judge_allows_incomplete_board	✅	1ms
tests/common/sudoku_test.py::test_judge_detects_row_violation	✅	1ms
tests/common/sudoku_test.py::test_judge_detects_column_violation	✅	1ms
tests/common/sudoku_test.py::test_judge_detects_block_violation	✅	1ms
tests/common/sudoku_test.py::test_4x4_generator_produces_valid_solution	✅	1ms
tests/common/sudoku_test.py::test_4x4_solution_is_fully_filled	✅	1ms
tests/common/sudoku_test.py::test_4x4_judge_detects_row_violation	✅	1ms
tests/common/sudoku_test.py::test_4x4_judge_detects_block_violation	✅	1ms
tests/common/vllm_test.py::ModelWrapperTest_0::test_generate	✅	57.1s
tests/common/vllm_test.py::ModelWrapperTest_1::test_generate	✅	38.9s
tests/common/vllm_test.py::ModelWrapperTest_2::test_generate	✅	37.6s
tests/common/vllm_test.py::TestModelLen_0::test_model_len	✅	32.0s
tests/common/vllm_test.py::TestModelLen_1::test_model_len	✅	25.1s
tests/common/vllm_test.py::TestModelLen_2::test_model_len	✅	27.2s
tests/common/vllm_test.py::TestModelLenWithoutPromptTruncation::test_model_len	✅	27.6s
tests/common/vllm_test.py::TestMessageProcess::test_no_prompt_truncation	✅	27.0s
tests/common/vllm_test.py::TestMessageProcess::test_truncation_status	✅	27.7s
tests/common/vllm_test.py::TestAPIServer::test_api	✅	28.5s
tests/common/vllm_test.py::TestLogprobs::test_logprobs_api	✅	26.7s
tests/common/vllm_test.py::TestAsyncAPIServer::test_api_async	✅	28.0s
tests/common/vllm_test.py::TestTinkerAsyncAPIServer::test_api_async	⏭️	1ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask	✅	265ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask_with_tools	✅	255ms
tests/common/vllm_test.py::TestAPIServerToolCall_0_deepseek_r1::test_api_tool_calls	✅	32.2s
tests/common/vllm_test.py::TestAPIServerToolCall_1::test_api_tool_calls	✅	27.9s
tests/common/vllm_test.py::TestSuperLongGeneration::test_generate	✅	2m 5s
tests/common/vllm_test.py::TestTinkerAPI::test_tinker_api	✅	41.0s

Github Test Reporter by CTRF 💚

github-actions · 2026-02-11T12:33:03Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
49	49	0	0	0	0	13m 11s

Tests

Test Name	Status	Duration
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	1m 49s
tests/explorer/explorer_test.py::TestExplorerEvalDetailedStats::test_explorer	✅	1m 12s
tests/explorer/explorer_test.py::TestExplorerGSM8KRULERNoEval::test_explorer	✅	58.1s
tests/explorer/explorer_test.py::TestExplorerGSM8k::test_explorer	✅	2m 59s
tests/explorer/explorer_test.py::ServeTest::test_serve	✅	55.1s
tests/explorer/proxy_test.py::RecorderTest::test_recorder	✅	64ms
tests/explorer/scheduler_test.py::SchedulerTest::test_async_workflow	✅	5.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_concurrent_operations	✅	4.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_dynamic_timeout	✅	13.0s
tests/explorer/scheduler_test.py::SchedulerTest::test_get_results	✅	29.1s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_0	✅	4.8s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_1	✅	4.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_0	✅	4.8s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_1	✅	4.6s
tests/explorer/scheduler_test.py::SchedulerTest::test_multi_step_execution	✅	5.1s
tests/explorer/scheduler_test.py::SchedulerTest::test_non_repeatable_workflow	✅	4.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_over_rollout_min_wait	✅	12.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_all_methods	✅	14.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_restart_after_stop	✅	8.8s
tests/explorer/scheduler_test.py::SchedulerTest::test_split_tasks	✅	8.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_stepwise_experience_eid	✅	24.8s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all	✅	7.9s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all_timeout_with_multi_batch	✅	13.6s
tests/explorer/scheduler_test.py::TestRunnerStateCollection::test_runner_state_collection	✅	9.5s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_0	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_1	✅	602ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_0	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_1	✅	1.0s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_raise_error	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_stop_at_max_env_steps	✅	1.0s
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	26ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_boxed_workflow	✅	16ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	128ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_eval_workflow	✅	3ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	11ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	7ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_1	✅	100ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_1	✅	201ms
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_0::test_multi_turn_workflow	✅	22.4s
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_1::test_multi_turn_workflow	✅	22.8s
tests/explorer/workflow_test.py::TestWorkflowStateRecording::test_workflow_state_recording	✅	4.0s
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter_v0	✅	772ms
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter_v1	✅	14ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner	✅	137ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner_get_state	✅	8.0s
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_with_openai	✅	27.1s
tests/explorer/workflow_test.py::TestConcurrentWorkflowRunner::test_concurrent_workflow_runner	✅	44.2s

Github Test Reporter by CTRF 💚

github-actions · 2026-02-11T12:57:56Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
16	16	0	0	0	0	22m 3s

Tests

Test Name	Status	Duration
tests/manager/synchronizer_test.py::TestSynchronizerExit_0::test_synchronizer	✅	2m 51s
tests/manager/synchronizer_test.py::TestSynchronizerExit_1::test_synchronizer	✅	2m 29s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	✅	2m 8s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	✅	1m 34s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	✅	2m 2s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	✅	2m 39s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_4::test_synchronizer	✅	2m 24s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_5::test_synchronizer	✅	2m 35s
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	✅	1m 8s
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	✅	1m 2s
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_2::test_synchronizer	✅	1m 3s
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_no_new_version_logs_warning	✅	5ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_0	✅	2ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_1	✅	4ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_2	✅	2ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_3	✅	2ms

Github Test Reporter by CTRF 💚

github-actions · 2026-02-11T13:01:48Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
4	4	0	0	0	0	1m 7s

Tests

Test Name	Status	Duration
tests/service/data_juicer_test.py::TestDataJuicer::test_config	✅	1.3s
tests/service/data_juicer_test.py::TestDataJuicer::test_server_start	✅	20.8s
tests/service/data_juicer_test.py::TestDataJuicerExperiencePipeline::test_data_juicer_operators	✅	24.1s
tests/service/data_juicer_test.py::TestDataJuicerTaskPipeline::test_data_juicer_task_pipeline	✅	16.4s

Github Test Reporter by CTRF 💚

github-actions · 2026-02-11T13:05:02Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
28	27	0	1	0	0	40.6s

Skipped

Tests	Status
tests/utils/swanlab_test.py::TestSwanlabMonitor::test_swanlab_monitor_smoke	skipped ⏭️

Tests

Test Name	Status	Duration
tests/utils/eval_utils_test.py::TestComputeScore::test_both_boxed_and_equivalent	✅	10ms
tests/utils/eval_utils_test.py::TestComputeScore::test_both_boxed_and_not_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_empty_ground_truth	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_empty_solution_string	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_multiple_boxed_answers_in_solution	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_boxed_truth_raw_and_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_boxed_truth_raw_and_not_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_not_boxed	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_raw_and_ground_truth_boxed_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_extract_answer	✅	3ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_verify_math_answer	✅	72ms
tests/utils/eval_utils_test.py::TestEvalUtils::test_is_equiv	✅	4ms
tests/utils/log_test.py::LogTest::test_actor_log	✅	5.5s
tests/utils/log_test.py::LogTest::test_group_by_node	✅	1.8s
tests/utils/log_test.py::LogTest::test_no_actor_log	✅	607ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_local_0__workspace_tests_utils_plugins	✅	81ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_local_1_tests_utils_plugins	✅	78ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_remote_0__workspace_tests_utils_plugins	✅	8.6s
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_remote_1_tests_utils_plugins	✅	8.7s
tests/utils/plugin_test.py::TestPluginLoader::test_passing_custom_class_0__workspace_tests_utils_plugins	✅	4.8s
tests/utils/plugin_test.py::TestPluginLoader::test_passing_custom_class_1_tests_utils_plugins	✅	4.9s
tests/utils/registry_test.py::TestRegistryWithRay::test_dynamic_import	✅	2.1s
tests/utils/registry_test.py::TestRegistry::test_algorithm_registry_mapping	✅	24ms
tests/utils/registry_test.py::TestRegistry::test_buffer_module_registry_mapping	✅	20ms
tests/utils/registry_test.py::TestRegistry::test_common_module_registry_mapping	✅	370ms
tests/utils/registry_test.py::TestRegistry::test_register_module	✅	1ms
tests/utils/registry_test.py::TestRegistry::test_utils_module_registry_mapping	✅	1ms
tests/utils/swanlab_test.py::TestSwanlabMonitor::test_swanlab_monitor_smoke	⏭️	1ms

Github Test Reporter by CTRF 💚

gemini-code-assist bot reviewed Feb 9, 2026

View reviewed changes

trinity/common/models/mm_utils.py Outdated Show resolved Hide resolved

trinity/common/models/mm_utils.py Show resolved Hide resolved

trinity/buffer/schema/formatter.py Outdated Show resolved Hide resolved

trinity/common/models/vllm_model.py Outdated Show resolved Hide resolved

chenyushuo added 3 commits February 10, 2026 17:32

1. Bug fix in chord_policy_loss.py

571402f

2. Fix `GPUMemoryValidator` for VL model

1. Support moonshotai/Kimi-VL-A3B-Thinking.

60bea1c

2. Remove `pad_token_id` in config. 3. Add `trust_remote_code` to config. 4. Add `get_model_class` for fsdp worker and fsdp checkpoint manager.

Merge branch 'main' of github.com:modelscope/Trinity-RFT into fix/enh…

7099bb8

…ance_vl

chenyushuo changed the title ~~[WIP] Enhance support for VL models~~ Enhance support for VL models Feb 11, 2026

apply suggestions from reviews

b1fc21c

chenyushuo added 2 commits February 11, 2026 15:47

fix unittest

b79a146

Merge branch 'main' of github.com:modelscope/Trinity-RFT into fix/enh…

942a903

…ance_vl

pan-x-c reviewed Feb 11, 2026

View reviewed changes

trinity/common/patch/kimi.py Show resolved Hide resolved

trinity/algorithm/policy_loss_fn/chord_policy_loss.py Show resolved Hide resolved

garyzhang99 reviewed Feb 11, 2026

View reviewed changes

trinity/algorithm/policy_loss_fn/chord_policy_loss.py Show resolved Hide resolved

pan-x-c reviewed Feb 11, 2026

View reviewed changes

examples/grpo_vlm/README.md Show resolved Hide resolved

chenyushuo added 3 commits February 11, 2026 17:33

apply reviews

e14fe76

Merge branch 'main' of github.com:modelscope/Trinity-RFT into fix/enh…

4073acb

…ance_vl

add qwen3 vl model

4974a02

chenyushuo added 2 commits February 11, 2026 19:14

add doc string

773fd9f

fix unittest

0c7d5bb

Enhance support for VL models #501

Are you sure you want to change the base?

Enhance support for VL models #501

Uh oh!

Conversation

chenyushuo commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

gemini-code-assist bot commented Feb 9, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Summary

Failed Tests

Skipped

Tests

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Summary

Failed Tests

Skipped

Tests

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Summary

Skipped

Tests

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

chenyushuo commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Summary

Tests

Uh oh!

github-actions bot commented Feb 11, 2026

Summary

Tests

Uh oh!

github-actions bot commented Feb 11, 2026

Summary

chenyushuo commented Feb 9, 2026 •

edited

Loading