Gateway API Inference Extension#
kubernetes-sigs/gateway-api-inference-extension
IGW_LATEST_RELEASE=$(curl -s https://api.github.com/repos/kubernetes-sigs/gateway-api-inference-extension/releases \
| jq -r '.[] | select(.prerelease == false) | .tag_name' \
| sort -V \
| tail -n1)
# Your Hugging Face Token with access to the set of Llama models
kubectl create secret generic hf-token --from-literal=token=$HF_TOKEN
kubectl apply -f "https://raw.githubusercontent.com/kubernetes-sigs/gateway-api-inference-extension/refs/tags/${IGW_LATEST_RELEASE}/config/manifests/vllm/gpu-deployment.yaml"
kubectl apply -f "https://github.com/kubernetes-sigs/gateway-api-inference-extension/releases/download/${IGW_LATEST_RELEASE}/manifests.yaml"
叶王 © 2013-2026 版权所有。如果本文档对你有所帮助,可以请作者喝饮料。