This website requires JavaScript.
Explore
Help
Sign In
Rain-Bus
/
nano-vllm
Watch
1
Star
0
Fork
0
You've already forked nano-vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
02a95fdc66c15d24d3fa36c72aaf9088f0af2ac5
nano-vllm
/
nanovllm
T
History
Xingkai Yu
02a95fdc66
Merge pull request
#203
from Anai-Guo/fix-row-parallel-bias-crash
...
fix RowParallelLinear weight_loader crash when bias is enabled
2026-04-13 21:26:07 +08:00
..
engine
Merge pull request
#200
from KinglittleQ/fix-scheduler-typing
2026-04-13 21:13:37 +08:00
layers
fix RowParallelLinear weight_loader crash when bias is enabled
2026-04-12 23:10:52 -07:00
models
support qwen2
2025-11-04 01:44:42 +08:00
utils
enable slots=True for dataclasses
2026-02-08 23:46:01 -05:00
__init__.py
better
2025-06-15 10:36:45 +08:00
config.py
enable slots=True for dataclasses
2026-02-08 23:46:01 -05:00
llm.py
support tensor parallel
2025-06-15 01:31:24 +08:00
sampling_params.py
enable slots=True for dataclasses
2026-02-08 23:46:01 -05:00