Last active
July 3, 2025 00:40
-
-
Save celsowm/8d03be46fe6fd3e50c7c19f5c81aef02 to your computer and use it in GitHub Desktop.
Build vllm on 5090
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
docker build . \ | |
--target vllm-openai \ | |
--file docker/Dockerfile \ | |
--tag vllm-5090:latest \ | |
--build-arg max_jobs=2 \ | |
--build-arg nvcc_threads=1 \ | |
--build-arg RUN_WHEEL_CHECK=false \ | |
--progress=plain | tee build.log |
huggingface-cli download tencent/Hunyuan-A13B-Instruct-GPTQ-Int4 --local-dir /var/models/hunyuan_gptq --local-dir-use-symlinks False --resume-download
para baixar modelo
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
dmesg -T | tail -n 50
para verificar log