๐Ÿ› ๏ธAI ๋„๊ตฌ2026-06-26

๋‰ด์Šค - ์›๋ฌธ ๊ธฐ๋ฐ˜ ์š”์•ฝ ํ•„์š”

๐Ÿ’ก ํ•œ์ค„ ์š”์•ฝ|๋‰ด์Šค - ์›๋ฌธ ๊ธฐ๋ฐ˜ ์š”์•ฝ ํ•„์š”


title: "๋ช…๋ น์–ด ํ•œ ์ค„๋กœ HF Jobs์— ํ”„๋ผ์ด๋น— vLLM ์„œ๋ฒ„ ๊ตฌ๋™" description: "๋‰ด์Šค - ์›๋ฌธ ๊ธฐ๋ฐ˜ ์š”์•ฝ ํ•„์š”" date: 2026-06-26 tags: [ai-news] source: "https://dev.to/mlxio_ai/one-command-spins-up-a-private-vllm-server-on-hf-jobs-33a0" sidebar: order: 0

์ œ๋ชฉ(ํ•œ๊ธ€): ๋ช…๋ น์–ด ํ•œ ์ค„๋กœ HF Jobs์— ํ”„๋ผ์ด๋น— vLLM ์„œ๋ฒ„ ๊ตฌ๋™ ์›๋ฌธ ์ œ๋ชฉ(์˜๋ฌธ): One Command Spins Up a Private vLLM Server on HF Jobs ์›๋ฌธ: One Command Spins Up a Private vLLM Server on HF Jobs ์†Œ์Šค: dev-to-ai MD ํŒŒ์ผ: content/2026-06-26/dev-to-ai-one-command-spins-up-a-private-vllm-server-on-hf-j.md

ํ•ต์‹ฌ ๋‚ด์šฉ

Hugging Face Jobs์—์„œ ๋ช…๋ น์–ด ํ•˜๋‚˜๋กœ ํ”„๋ผ์ด๋น— OpenAI ํ˜ธํ™˜ vLLM ์„œ๋ฒ„๋ฅผ ๋„์šธ ์ˆ˜ ์žˆ๊ฒŒ ๋์–ด์š”.

hf jobs run ๋ช…๋ น์–ด์— ๊ณต์‹ vllm/vllm-openai ์ปจํ…Œ์ด๋„ˆ๋ฅผ ์ง€์ •ํ•˜๋ฉด ํฌํŠธ 8000์œผ๋กœ ์—”๋“œํฌ์ธํŠธ๊ฐ€ ์ƒ๊ธฐ๊ฑฐ๋“ ์š”. VM ์„ธํŒ…๋„, Kubernetes๋„ ์—†์ด์š”. ๋น„์šฉ์€ ์žก์ด ์‹คํ–‰๋˜๋Š” ์‹œ๊ฐ„๋งŒํผ๋งŒ ์ดˆ ๋‹จ์œ„๋กœ ์ฒญ๊ตฌ๋˜๋Š” ๊ตฌ์กฐ์˜ˆ์š”.

ํ…Œ์ŠคํŠธ, ํ‰๊ฐ€, ๋ฐฐ์น˜ ์ƒ์„ฑ, ๋น ๋ฅธ ๋ชจ๋ธ ์‹คํ—˜์— ๋”ฑ ๋งž๋Š” ๋ฐฉ์‹์ด์—์š”. ์„œ๋ฒ„ ํ”„๋กœ๋น„์ €๋‹ ์—†์ด HF ์ธํ”„๋ผ ์œ„์—์„œ LLM ์—”๋“œํฌ์ธํŠธ๋ฅผ ๋ฐ”๋กœ ์“ธ ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒŒ ํ•ต์‹ฌ์ด๊ฑฐ๋“ ์š”.

์žฅ๊ธฐ ์šด์˜์ด ํ•„์š”ํ•˜๋ฉด HF Inference Endpoints๋กœ ๊ฐ€๋ฉด ๋˜๊ณ , ๋‹จ๋ฐœ์„ฑ ์ž‘์—…์—” ์ด ๋ฐฉ์‹์ด ํ›จ์”ฌ ๊ฐ€๋ณ๊ฒŒ ์“ธ ์ˆ˜ ์žˆ์–ด์š”.

์žก๋Œ์Œค์˜ ํ•œ๋งˆ๋””

์„œ๋ฒ„ ์„ธํŒ… ์—†์ด HF ์ธํ”„๋ผ์—์„œ ๋ชจ๋ธ ์‹คํ—˜์„ ๋ฐ”๋กœ ํ•  ์ˆ˜ ์žˆ์–ด์š”. ํ…Œ์ŠคํŠธยทํ‰๊ฐ€ยท๋ฐฐ์น˜ ์ž‘์—…์˜ ์ง„์ž… ์žฅ๋ฒฝ์ด ํฌ๊ฒŒ ๋‚ฎ์•„์ง„ ๊ฑฐ์˜ˆ์š”.


์ถœ์ฒ˜: One Command Spins Up a Private vLLM Server on HF Jobs

์ด ๊ธ€์ด ์–ด๋• ๋‚˜์š”?