Commit Graph

  • ffd2defdfc add Chinese annotations to all source files for learning purposes main Rain-Bus 2026-05-25 21:33:15 +08:00
  • bb823b3e06 Merge pull request #218 from GeeeekExplorer/chunked-prefill-refactor Xingkai Yu 2026-04-26 13:10:12 +08:00
  • 9fa256a56d fix cache hit GeekExplorer 2026-04-26 03:49:14 +08:00
  • f64d821c20 fix chunked prefill bugs and refactor GeekExplorer 2026-04-26 02:53:06 +08:00
  • 44a51afc8a Merge pull request #207 from DestineG/fix-prefill-index-out-of-range Xingkai Yu 2026-04-25 18:12:43 +08:00
  • 5df84934f3 Merge pull request #213 from Anai-Guo/fix/prepare-prefill-seqlen-k-chunked-prefill Xingkai Yu 2026-04-25 17:43:24 +08:00
  • 25794a1f29 fix(model_runner): correct seqlen_k to chunk boundary in prepare_prefill Tai An 2026-04-22 15:13:19 -07:00
  • 77dd709ca1 fix(scheduler): recalculate num_tokens after allocate to prevent IndexError six 2026-04-20 16:34:27 +08:00
  • 812eb1c1e4 Merge pull request #204 from GeeeekExplorer/chunked-prefill Xingkai Yu 2026-04-14 03:06:27 +08:00
  • 8d63a98c03 support chunked prefill and fix minor bug GeekExplorer 2026-04-14 02:47:35 +08:00
  • 9e8507ef41 minor simplify GeekExplorer 2026-04-13 22:09:46 +08:00
  • 02a95fdc66 Merge pull request #203 from Anai-Guo/fix-row-parallel-bias-crash Xingkai Yu 2026-04-13 21:26:07 +08:00
  • a4f94cb38b Merge pull request #200 from KinglittleQ/fix-scheduler-typing Xingkai Yu 2026-04-13 21:13:37 +08:00
  • 00eea73176 Merge pull request #172 from IceCreamMilkyTea/main Xingkai Yu 2026-04-13 20:45:50 +08:00
  • 52d2215911 Merge pull request #148 from guodongxiaren/main Xingkai Yu 2026-04-13 20:36:32 +08:00
  • 7f967ed6ff Merge pull request #145 from LiaoMengqi/fix/tp Xingkai Yu 2026-04-13 20:34:49 +08:00
  • bf99453d90 fix RowParallelLinear weight_loader crash when bias is enabled Anai-Guo 2026-04-12 23:10:52 -07:00
  • 498f5a1aa8 Fix scheduler.postprocess return type Chengqi Deng 2026-04-11 13:23:49 +08:00
  • f438ce463f enable slots=True for dataclasses IceCreamMilkyTea 2026-02-08 23:46:01 -05:00
  • 55c64e7fdf remove hard code for block_size guodongxiaren 2025-12-30 01:53:55 +08:00
  • 82f5ca244f fix bug for tp Mengqi 2025-12-18 01:28:25 +08:00
  • 2f21442653 support qwen2 GeeeekExplorer 2025-11-04 01:44:09 +08:00
  • db1b49dce4 add logo and trendshift GeeeekExplorer 2025-11-04 00:35:12 +08:00
  • 6ef2a4f630 compile random sampling GeeeekExplorer 2025-08-31 22:55:34 +08:00
  • df99418f7d simplify GeeeekExplorer 2025-08-31 19:44:57 +08:00
  • 6a6d217de7 Merge pull request #67 from PeterDing/fix/decoding-positions Xingkai Yu 2025-08-31 18:05:45 +08:00
  • f5b4840276 fix(model_runner): correct position indexing to be 0-based PeterDing 2025-07-04 14:29:12 +08:00
  • 38baf0bbe4 remove assert shape GeeeekExplorer 2025-06-27 23:00:30 +08:00
  • 2de882a395 Merge pull request #60 from GeeeekExplorer/warmup Xingkai Yu 2025-06-27 22:52:11 +08:00
  • cb0b3dec3f remove rng state GeeeekExplorer 2025-06-27 22:50:33 +08:00
  • 6802cb2f42 Merge pull request #54 from TonyLianLong/patch-1 Xingkai Yu 2025-06-27 22:44:38 +08:00
  • 1caeec8dfa same as vllm GeeeekExplorer 2025-06-27 18:50:56 +08:00
  • 658520b788 warmup and allocate GeeeekExplorer 2025-06-27 01:51:57 +08:00
  • c2ee8b8dff Update pyproject.toml to fix missing files Long(Tony) Lian 2025-06-25 17:57:38 -07:00
  • cfc4cb6710 docs: add manual download instructions papadopoulos Aggelos-Michael 2025-06-24 18:38:28 +03:00
  • 37eb91f890 Merge pull request #39 from xiaohajiayou/main Xingkai Yu 2025-06-24 22:51:58 +08:00
  • 054aec852d Fix: Division-by-Zero Risk and Typo xiaohajiayou 2025-06-24 02:02:33 +08:00
  • 03cfc13bb3 faster pickle GeeeekExplorer 2025-06-23 00:51:52 +08:00
  • 8162578b60 star history Xingkai Yu 2025-06-22 15:13:04 +08:00
  • cde3fc22c2 simplify GeeeekExplorer 2025-06-21 17:04:53 +08:00
  • ad4e95fbdc update .gitignore Xingkai Yu 2025-06-21 07:28:40 +08:00
  • 801365a611 update bench GeeeekExplorer 2025-06-19 23:24:43 +08:00
  • fa0078174e Merge pull request #24 from jinghuan-Chen/fix/Release-CUDA-Graphs-resource-before-exit Xingkai Yu 2025-06-18 17:15:28 +08:00
  • ffafaeb133 Release CUDA Graphs resource before exit. jinghuan-Chen 2025-06-18 16:17:31 +08:00
  • 4fc764f175 Merge pull request #22 from cheunglei/use_spawn Xingkai Yu 2025-06-17 23:53:59 +08:00
  • b5ace32982 use spawn cheunglei 2025-06-17 22:48:44 +08:00
  • bc0ad5a116 better GeeeekExplorer 2025-06-17 23:15:02 +08:00
  • 7e42fa6f63 fix GeeeekExplorer 2025-06-15 13:09:05 +08:00
  • 326b121fad Merge pull request #10 from MARD1NO/refine_return_hint_in_schedule Xingkai Yu 2025-06-15 10:39:51 +08:00
  • ba96387043 Merge pull request #11 from GeeeekExplorer/tp_dev Xingkai Yu 2025-06-15 10:37:21 +08:00
  • fc778a4da9 better GeeeekExplorer 2025-06-15 10:31:48 +08:00
  • c1fd4ea3c2 Merge pull request #9 from cheunglei/tp_dev Xingkai Yu 2025-06-15 10:22:18 +08:00
  • 98bbbefb68 schedule return bool args MARD1NO 2025-06-15 10:15:05 +08:00
  • 53b3ef2e32 support tensor parallel cheunglei 2025-06-15 01:31:24 +08:00
  • b6136383c9 support fast pickle GeeeekExplorer 2025-06-14 13:36:57 +08:00
  • 4a8aa090a7 fix GeeeekExplorer 2025-06-14 00:36:32 +08:00
  • 9b59dae751 Merge pull request #4 from cheunglei/main Xingkai Yu 2025-06-13 23:46:18 +08:00
  • 0ea7414b19 require xxhash cheunglei 2025-06-13 23:40:07 +08:00
  • 59aa3ff57c better GeeeekExplorer 2025-06-13 13:07:33 +08:00
  • 135d1b38a2 release GeeeekExplorer 2025-06-13 00:41:33 +08:00
  • 98a1551a7d support CUDA_VISIBLE_DEVICES GeeeekExplorer 2025-06-12 23:14:01 +08:00
  • ec3c60d96f update bench GeeeekExplorer 2025-06-12 09:47:09 +08:00
  • f16adb729e refactor GeeeekExplorer 2025-06-12 09:41:12 +08:00
  • fee58d44e4 fix GeeeekExplorer 2025-06-11 21:17:23 +08:00
  • 08c84ec08d multi file loader GeeeekExplorer 2025-06-11 22:32:48 +08:00
  • 386290d69e refactor GeeeekExplorer 2025-06-11 21:12:57 +08:00
  • b98e1ca305 fix GeeeekExplorer 2025-06-10 08:52:58 +08:00
  • a5a4909e6a init commit GeeeekExplorer 2025-06-10 00:23:23 +08:00