-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: adding fallback for incomplete tokenizer cache
#14762
opened Dec 9, 2025 by
dougyster
Loading…
1 task
fix: race condition between validation and download locks
#14761
opened Dec 9, 2025 by
alisonshao
Loading…
2 tasks
[Auto Sync] Update data_parallel_controller.py, detokenizer... (20251209)
run-ci
#14759
opened Dec 9, 2025 by
merrymercy
Loading…
fix: adding rate limit warning at verify token permission stage
#14756
opened Dec 9, 2025 by
dougyster
Loading…
1 task
[Tool Call][DSV32] Streamline function call parameters
deepseek
#14750
opened Dec 9, 2025 by
Muqi1029
Loading…
2 of 6 tasks
[Tool Call] Add Parallel tool calls request param
#14749
opened Dec 9, 2025 by
Muqi1029
Loading…
2 of 6 tasks
fix: handle Jinja2 template errors as client errors in OpenAIServingChat
run-ci
#14748
opened Dec 9, 2025 by
JustinTong0323
Loading…
6 tasks
Fix spec info's filter when reqs are finished right after prefill
#14742
opened Dec 9, 2025 by
hnyls2002
Loading…
[NPU] Support w4a8 with activation clip
npu
#14736
opened Dec 9, 2025 by
jiaming1130
Loading…
6 tasks
Fix:
ReasonerGrammarBackend can handle tokenizer that does not have "think_end_id" as attribute. (ex: GLM 4.6)
#14731
opened Dec 9, 2025 by
gmlwns2000
Loading…
[Feat] Async KV transfer for PD disaggregation
amd
blackwell
SM100/SM120
deepseek
dependencies
Pull requests that update a dependency file
diffusion
SGLang Diffusion
documentation
Improvements or additions to documentation
hicache
Hierarchical Caching for SGLang
lora
model-gateway
Multi-modal
multi-modal language model
npu
quant
LLM Quantization
sgl-kernel
#14730
opened Dec 9, 2025 by
Kevin-XiongC
•
Draft
Use bootstrap server to sync prefill dp rank
documentation
Improvements or additions to documentation
#14726
opened Dec 9, 2025 by
changhuaixin
Loading…
1 of 6 tasks
[GLM-4.6V] Support Pipeline Parallelism for GLM-4.6V & GLM-4.1V
Multi-modal
multi-modal language model
performance
run-ci
vlm
#14720
opened Dec 9, 2025 by
yuan-luo
Loading…
6 tasks
Bugfix: Writing to storage when write-back method is chosen
#14718
opened Dec 9, 2025 by
MMuzzammil1
Loading…
2 of 6 tasks
[diffusion] kernel fusion: add layernorm scale shift fusion for Wan and Qwen-Image models
diffusion
SGLang Diffusion
sgl-kernel
#14717
opened Dec 9, 2025 by
jianyingzhu
Loading…
6 tasks
Tiny support customizing Prometheus duration buckets
model-gateway
run-ci
#14716
opened Dec 9, 2025 by
fzyzcjy
Loading…
6 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.