Billy D. a16ffff73f
Some checks failed
Build and Push Images / build-nvidia (push) Failing after 1s
Build and Push Images / build-rdna2 (push) Failing after 1s
Build and Push Images / build-strixhalo (push) Failing after 1s
Build and Push Images / build-intel (push) Failing after 1s
feat: Add GPU-specific Ray worker images with CI/CD
- Add Dockerfiles for nvidia, rdna2, strixhalo, and intel GPU targets
- Add ray-serve modules (embeddings, whisper, tts, llm, reranker)
- Add Gitea Actions workflow for automated builds
- Add Makefile for local development
- Update README with comprehensive documentation
2026-02-01 15:04:31 -05:00
2026-02-01 19:59:37 +00:00

KubeRay Worker Images

GPU-specific Ray worker images for the DaviesTechLabs AI/ML platform.

Images

Image GPU Target Workloads Registry
ray-worker-nvidia NVIDIA CUDA (RTX 2070) Whisper STT, XTTS TTS git.daviestechlabs.io/daviestechlabs/ray-worker-nvidia
ray-worker-rdna2 AMD ROCm (Radeon 680M) BGE Embeddings git.daviestechlabs.io/daviestechlabs/ray-worker-rdna2
ray-worker-strixhalo AMD ROCm (Strix Halo) vLLM, BGE git.daviestechlabs.io/daviestechlabs/ray-worker-strixhalo
ray-worker-intel Intel XPU (Arc) BGE Reranker git.daviestechlabs.io/daviestechlabs/ray-worker-intel

Building Locally

# Build all images
make build-all

# Build specific image
make build-nvidia
make build-rdna2
make build-strixhalo
make build-intel

# Push to Gitea registry (requires login)
docker login git.daviestechlabs.io
make push-all

CI/CD

Images are automatically built and pushed to git.daviestechlabs.io package registry on:

  • Push to main branch
  • Git tag creation (e.g., v1.0.0)

Gitea Actions Secrets Required

Add these secrets in Gitea repo settings → Actions → Secrets:

Secret Description
REGISTRY_USER Gitea username
REGISTRY_TOKEN Gitea access token with package:write scope

Directory Structure

kuberay-images/
├── dockerfiles/
│   ├── Dockerfile.ray-worker-nvidia
│   ├── Dockerfile.ray-worker-rdna2
│   ├── Dockerfile.ray-worker-strixhalo
│   ├── Dockerfile.ray-worker-intel
│   └── ray-entrypoint.sh
├── ray-serve/
│   ├── serve_embeddings.py
│   ├── serve_whisper.py
│   ├── serve_tts.py
│   ├── serve_llm.py
│   ├── serve_reranker.py
│   └── requirements.txt
├── .gitea/workflows/
│   └── build-push.yaml
├── Makefile
└── README.md

Environment Variables

Variable Description Default
RAY_HEAD_SVC Ray head service name ai-inference-raycluster-head-svc
GPU_RESOURCE Custom Ray resource name gpu_nvidia, gpu_amd, etc.
NUM_GPUS Number of GPUs to expose 1

Node Allocation

Node Image GPU Memory
elminster ray-worker-nvidia RTX 2070 8GB VRAM
khelben ray-worker-strixhalo Strix Halo 64GB Unified
drizzt ray-worker-rdna2 Radeon 680M 12GB VRAM
danilo ray-worker-intel Intel Arc 16GB Shared
Description
Where all my kuberay images will go
Readme MIT 328 KiB
Languages
Python 77.3%
Makefile 14.7%
Shell 8%