Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix performance regression for prefix caching
#4270 opened Jan 13, 2026 by lzhangzz Loading…
test: add mixing guided and non-guided tests
#4267 opened Jan 12, 2026 by windreamer Loading…
Refactor Engine & ModelAgent interact
#4265 opened Jan 11, 2026 by grimoire Loading…
docs: add cli docs
#4264 opened Jan 9, 2026 by windreamer Loading…
Update benchmark serving script for proxy_server
#4173 opened Dec 1, 2025 by lvhan028 Loading…
Update installation.md
#4095 opened Nov 3, 2025 by krescent Loading…
Add step_map to track token decoding order in DLLM
#4057 opened Oct 21, 2025 by Auraithm Loading…
4 tasks done
[POC] Encoder Disaggregation
#4047 opened Oct 17, 2025 by CUHKSZzxy Draft
2 of 7 tasks
quant blocked fp8 enhancement New feature or request
#4018 opened Sep 29, 2025 by CUHKSZzxy Loading…
4 of 5 tasks
Add reasoning parser for GPT-OSS style channels.
#3998 opened Sep 21, 2025 by GY19A Loading…
[PD Disaggregation] remote recomputation preemption
#3854 opened Aug 18, 2025 by JimyMa Loading…
add ppu quick start doc documentation Improvements or additions to documentation
#3841 opened Aug 14, 2025 by guozixu2001 Loading…
support pp in turbomind
#3768 opened Jul 24, 2025 by irexyc Draft
1 task
fix: make project PEP 517 compliant.
#3738 opened Jul 17, 2025 by windreamer Loading…
5 tasks done
Add dp rank into proxy node status
#3720 opened Jul 8, 2025 by RunningLeon Loading…
[ascend] support lora enhancement New feature or request
#3715 opened Jul 7, 2025 by tangzhiyi11 Draft
expert distributions
#3709 opened Jul 4, 2025 by CUHKSZzxy Loading…
ProTip! Filter pull requests by the default branch with base:main.