System Info / 系統信息
Driver Version: 575.57.08 CUDA Driver Version: 12.9 RTX 5090 (Blackwell)
vllm 0.14.1
transformers 5.0.0.dev0 (在4.57.6显示 transformers不支持)
Who can help? / 谁可以帮助到您?
@zRzRzRzRzRzRzR
Information / 问题信息
Reproduction / 复现过程
CUDA_VISIBLE_DEVICES=2,3
vllm serve /data2/models/ZhipuAI/GLM-4.7-Flash
--tensor-parallel-size 2
--tool-call-parser glm47
--reasoning-parser glm45
--quantization fp8
--max-model-len 32768
--enable-auto-tool-choice
--served-model-name glm-4.7-flash
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 335, in load_weights
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] autoloaded_weights = set(self._load_module("", self.module, weights))
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 288, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] yield from self._load_module(
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 288, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] yield from self._load_module(
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 319, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] raise ValueError(msg)
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ValueError: There is no module or parameter named 'model.layers.47' in TransformersMoEForCausalLM
Expected behavior / 期待表现
launches without error
System Info / 系統信息
Driver Version: 575.57.08 CUDA Driver Version: 12.9 RTX 5090 (Blackwell)
vllm 0.14.1
transformers 5.0.0.dev0 (在4.57.6显示 transformers不支持)
Who can help? / 谁可以帮助到您?
@zRzRzRzRzRzRzR
Information / 问题信息
Reproduction / 复现过程
CUDA_VISIBLE_DEVICES=2,3
vllm serve /data2/models/ZhipuAI/GLM-4.7-Flash
--tensor-parallel-size 2
--tool-call-parser glm47
--reasoning-parser glm45
--quantization fp8
--max-model-len 32768
--enable-auto-tool-choice
--served-model-name glm-4.7-flash
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 335, in load_weights
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] autoloaded_weights = set(self._load_module("", self.module, weights))
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 288, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] yield from self._load_module(
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 288, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] yield from self._load_module(
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 319, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] raise ValueError(msg)
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ValueError: There is no module or parameter named 'model.layers.47' in TransformersMoEForCausalLM
Expected behavior / 期待表现
launches without error