使用Datawhale的AMD云平台搭建大模型环境

Mark_Aussie

51人浏览 · 2026-06-23 10:55:06

Mark_Aussie · 2026-06-23 10:55:06 发布

注册AMD的账号：AMD AI 开发者计划

进入Datawhale的云平台：Radeon Cloud

打开终端：

查看GPU情况：amd-smi

检测pytorch：python -c "import torch; print('PyTorch:', torch.__version__); print('ROCm available:', torch.cuda.is_available()); print('Device:', torch.cuda.get_device_name(0) if torch.cuda.is_available() else 'N/A')"

下载 Gemma4 模型

切换腾讯源：pip config set global.index-url https://mirrors.cloud.tencent.com/pypi/simple/

安装魔塔：pip install modelscope

下载Gemma4模型：modelscope download --model google/gemma-4-E4B-it --cache_dir "./models"

确认模型下载成功：ls -lh ./models/google/gemma-4-E4B-it/

安装vllm：

uv pip uninstall torchvision torchaudio # 在该云环境中，需卸载重新安装这个库才能正常使用
uv pip install 'vllm==0.23.0+rocm723' torchvision torchaudio 'fastapi[standard]==0.136.0' \
--no-cache \
--index-url https://mirrors.aliyun.com/pypi/simple/ \
--extra-index-url https://wheels.vllm.ai/rocm/ \
-U

启动模型：vllm serve ./models/google/gemma-4-E4B-it/ --served-model-name gemma-4-E4B-it

打开新终端：vllm chat --url http://localhost:8000/v1 --model gemma-4-E4B-it

测试大模型回答效果，在windows可使用ctrl+C终止

参考：

【Day1-2】15分钟部署&运行 Gemma4 大模型，撰写学习笔记任务详情 - Datawhale AI学习中心