Rain-Bus
ffd2defdfc
add Chinese annotations to all source files for learning purposes
...
Annotated 16 source files covering the full architecture:
engine (scheduler, block manager, model runner), layers (attention,
linear, sampler, etc.), model (qwen3), and utils.
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com >
2026-05-25 21:33:15 +08:00
GeekExplorer
9fa256a56d
fix cache hit
2026-04-26 03:49:14 +08:00
GeekExplorer
f64d821c20
fix chunked prefill bugs and refactor
2026-04-26 02:53:06 +08:00
GeekExplorer
8d63a98c03
support chunked prefill and fix minor bug
2026-04-14 03:05:35 +08:00
GeekExplorer
9e8507ef41
minor simplify
2026-04-13 22:09:46 +08:00
Mengqi
82f5ca244f
fix bug for tp
2025-12-18 01:28:25 +08:00
GeeeekExplorer
df99418f7d
simplify
2025-08-31 20:02:51 +08:00
GeeeekExplorer
cde3fc22c2
simplify
2025-06-21 17:19:15 +08:00
GeeeekExplorer
bc0ad5a116
better
2025-06-17 23:33:38 +08:00
GeeeekExplorer
b6136383c9
support fast pickle
2025-06-14 13:36:57 +08:00
GeeeekExplorer
59aa3ff57c
better
2025-06-13 13:07:33 +08:00
GeeeekExplorer
ec3c60d96f
update bench
2025-06-12 22:54:51 +08:00
GeeeekExplorer
f16adb729e
refactor
2025-06-12 09:41:12 +08:00
GeeeekExplorer
386290d69e
refactor
2025-06-11 21:12:57 +08:00
GeeeekExplorer
a5a4909e6a
init commit
2025-06-10 00:27:01 +08:00